Gene Tery_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1785 
Symbol 
ID4243767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2726086 
End bp2727363 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content32% 
IMG OID638106909 
Producthypothetical protein 
Protein accessionYP_721517 
Protein GI113475456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000770102 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCA ATCAATTTTA TCCCGATAAA TCTGTTCCTC CAGAAAAATT TGTAGGTAGA 
ACATCTGAAC TATCCACGAT TTTTGACAAA ATTAATAGTC GCGATCATGT TGCTATTTTT
GGTAGTTCTG GTATGGGTAA AACATCTTTG CTTCAATATA TAGAAAACTC TAAATTTTGG
GAAGAAAGAG ACTTAGATTT TTCAGAAGCT TTGATTGTTT ATCACAACTG CGAGCTTTCA
GTTATAGATA GTTTTTGGCA AGAAGTCCTC AGAACATTAA TAGATAAAGC TACAGGTGAT
CAAGATTTAG TGAGTAAGAT TAATGCTTTA TTAGGGTTGG AGAAAATAGA AATAACAGAT
ATACGGGAGC TTCTCAGAGA GATTGGGAAA AGAGGTAAGT TTTTATTATT ATTATTAGAT
GACTATCATA GAATACTTGG TACACGAGAA GAGTATCCGG AAAACCAGGG AAAAAAATCT
AAAAAAGTGC TGACTTTTTT AAGTGAGTTG CGTAACCTAG CAGTTCATAA TAGAGAAGGT
CAATATTTCT CAACTATTGT TGCTACGTTT CAAAAGTTGC ATGAACTAGG TCCAACAATT
GTTCCTGGTG GTTCTCCTTG GTATAATCAT TATGCCTATC TACCTTTAAA ACCTTTTTGT
AAAAGCGATA TTGAGGGTCA TTTTTTTAAT CGCGATAGTC ATTTTTTCAT TTCAGATGCT
CCGAAAGAAG AAGTTTTAAA AATGACTGGT GGGTATCCAG CGTTACTTCA GTTTACAGGT
TATATATTTT CTCGCTTGGA ACCAGTTAAT GTTGATACTC TGAATACAAT GTTAAAAAAC
GATGCTGATA GAATTTTTCA GGATGTCTGG AACAATTTTG AAAAAAATGA GCAAGAAATT
TTGCAGTTAA TTTTAATTGA TAAATATAAG GGTAAATTCA GGGAAATTTC TTATTCTATT
GCTGGCATAG AAAAAGAGTT TATTCGCAAT ATTAGCATAT TGAAGAGTCT TGAAGAAAAA
GGATTTATCA GTCAGGTTAA ACAAGCAAAT AAATATAGTT TTACTTCTTC TTTAATGGAA
GATTTTATTG GTGATCAACT TGCAGAAAAA AATGTTTCAA ACGCTAAAGA CCGCAAGATA
GTGATTAATT TATTTATTAT CAAGATTACT CTTGGACTAT GGAAGAAAGT TAAAGAAAAA
ATACAGCCTG TTACTAAGTT CATATCACCT CTTGCTAAAA TCATCGATTT AATCGCTAAC
AAAATAGAAG GCAAATAA
 
Protein sequence
MPRNQFYPDK SVPPEKFVGR TSELSTIFDK INSRDHVAIF GSSGMGKTSL LQYIENSKFW 
EERDLDFSEA LIVYHNCELS VIDSFWQEVL RTLIDKATGD QDLVSKINAL LGLEKIEITD
IRELLREIGK RGKFLLLLLD DYHRILGTRE EYPENQGKKS KKVLTFLSEL RNLAVHNREG
QYFSTIVATF QKLHELGPTI VPGGSPWYNH YAYLPLKPFC KSDIEGHFFN RDSHFFISDA
PKEEVLKMTG GYPALLQFTG YIFSRLEPVN VDTLNTMLKN DADRIFQDVW NNFEKNEQEI
LQLILIDKYK GKFREISYSI AGIEKEFIRN ISILKSLEEK GFISQVKQAN KYSFTSSLME
DFIGDQLAEK NVSNAKDRKI VINLFIIKIT LGLWKKVKEK IQPVTKFISP LAKIIDLIAN
KIEGK