Gene Tery_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2689 
Symbol 
ID4245184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4159429 
End bp4161774 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content38% 
IMG OID638107752 
Producthypothetical protein 
Protein accessionYP_722351 
Protein GI113476290 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00230732 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACTAATA AAGCAATTAC TATAGGTGTT CAGAAATATC AGTTTTTTTC TCCTCTGAAA 
TATGCAGCTA ATGATGCTGA AAAGATGAGG AATTTTTTGC TGGAAGAAGC AGGTTTTGAT
GAGGTTCTTT ACTACTCAGA TTATTCTCCA GAGATTAATG GTGATTATAC TCGCCCAACT
CGTTCTAATT TAGAATTCTT GTTAGAAAAT CAGTTTAAAG AACCATTTAT GGGTATTGGT
GATAATTTTT GGTTTTTTTT TAGTGGTCAT GGTCTTAGAG AGAATGGTAT TGATTATCTC
ATTCCTGTTG ATGGTTATAA AAATGTTCAA AAAAGTGGAA TTTCTGTTAA TTATATTATT
CAACAACTTC AAAAATGTGG AGCCGATAAT GTAGTTTTGA TATTGGATGC TTGTAGAGAT
GAGGGTGATG CAAGAAGGGG AGGTAAAGGA ATAGGAGAAC AAACAGAACT GGAAGCTATT
GAAAAGGGAG TAATTACTAT TTTTTCTTGT AGTCCTAATG AATATTCTTG GGAATTAGAA
GAATTTCAAC AAGGAGCTTT TACTTATGCA TTATTAGAAG GGTTAGGTAG TAAAGGTCAA
AAAGCAACTG TAGAAAAACT GAATGATTAC CTAAAATATA GGGTGAAAGA ATTATCTCAA
GATAAGGGAA AACAAACTCC TCGTATTACT ATTGATCCTC TGGAAAAATC TCATTTAATT
TTGATGCCAA AGTATGCAAC TTTAGCTGAT ATTGCAACTT TAAAACTTGA TGCATTTAGG
GCACAAAGCA AAAGAGAATT TGGAAAGGCT AAGTCTCTTT GGAGAATGGT ACTTAATGCG
GCTCAAGGTC CTGATGAAGA TGCAATAGAA GCTCTGAATG AGATAGCTAT TCAGGAAAAA
TTGGGAGATT TTAGTCAGTT TCCATTTAGT CCAAAACCAG AGAATTTTGA GAATTTACCT
ACTTCAGAAA CTCAACCACC AAAGTCAACA GAATTATTAG AAAGTTTAAA TATTCAAACT
CCATCAAAAC CACAAGGAAA GCCACAAGGA AAGCCACAAG GAAGGTCGAA ACCAAAGTCG
GAACCAAAGC CAAAAGCAAA AACTCCAACT AGATCTATAC CTAAAACCCA ACTCCCAGGC
AAAAAAACCA GAAGGCAAAT ATTGATATTA GGAGGTCTAG CCGGAAGCGG GTTTGTGGGG
ACGGTTTTAA CTCAAATATT TTTCAAGGAA CCATCTACAG AAAATATTTC TAGTTCTGAC
CAAGAAGCAC TTTTAGAACC AGCAGATATT TCTACTCCAC CACAACAAAA AACTGTTACC
CAGGAGTTTA CCACTGTTAA ACTGAACAAC ACAGGAGAAA TAATAAGCCG CATATTTTTG
ATATTAGGAG GTTTAGCCGG AAGCGGGTTT GTGGGGACGG TTTTAACTCA AAGATTTCTC
AAGGAACCAT CTACAGAAAA TATTTCTACT TCTGACCAAG AAGCACTTTT AGAACCAGCA
GATATTTCTA CTCCACCACA ACAAAAAACT GTTACCCAGG AGTTTACCAC TGTTAAACTG
AACAACACAG GAAAAATAAT AAGCCGCACT AAAGGTAAAG CAGAAGTAAT GACAGAAAAC
CTGGGTAATA GAGTTTCTCT AGAAATGGTA AAAATTCCCG GAGGTAGGTT TTTGATGGGG
TCTCCAGAGA CGGAAGCAGG AAGACGTGAT AACGAAGATC CGCAACATTA TGTAGATGTG
CCGGAATTTT TCATGGGAAA GTATGTAGTT ACTCAAGCAC AATGGCAAGC AGTTATGGGA
AATAACCCTG CTAAGTTTAA AGGTGCAAGT CGTCCTGTGG AAAGAGTAAG TTGGAATGAC
GCGATAAAAT TTTGTCAGAA ACTCTCACAA ATAACAGGAA GAAAATATAG TTTGCCCAGT
GAGAGTCAAT GGGAATATGC TTGTCGAGCC GGAACAACAA CACCATTTTA TTTTGGAGAG
ACTATAACAC CTGAGTTAGT TAACTATGAT GGCAACTACA CTTACGGTAA TGCGCCAAAA
GGAATATATA GAAAAGAAAC AACAGATGTG GGAATTTTTC CACCCAATAG TTTTGGTTTG
TACGATATGC ACGGGAATGT TTGGGAATGG TGTGCTGATG AATGGCATGA TAACTATGAT
GGTGCGCCTA CAGATGGCAG TGTTTGGCTA AATGGAAATA AAGCTCGATC ACCGCTGCGG
GGCGGTTCTT GGAGCAACAA TCCTCTTTTT TGCCGTTCTG CGGTTCGCCT CTACTATAAT
AGGCGCGACG ACCACAGCCT CACTTTTGGT TTTCGTCTTG TCTGCGATGG CGGGAGAACT
CTTTAA
 
Protein sequence
MTNKAITIGV QKYQFFSPLK YAANDAEKMR NFLLEEAGFD EVLYYSDYSP EINGDYTRPT 
RSNLEFLLEN QFKEPFMGIG DNFWFFFSGH GLRENGIDYL IPVDGYKNVQ KSGISVNYII
QQLQKCGADN VVLILDACRD EGDARRGGKG IGEQTELEAI EKGVITIFSC SPNEYSWELE
EFQQGAFTYA LLEGLGSKGQ KATVEKLNDY LKYRVKELSQ DKGKQTPRIT IDPLEKSHLI
LMPKYATLAD IATLKLDAFR AQSKREFGKA KSLWRMVLNA AQGPDEDAIE ALNEIAIQEK
LGDFSQFPFS PKPENFENLP TSETQPPKST ELLESLNIQT PSKPQGKPQG KPQGRSKPKS
EPKPKAKTPT RSIPKTQLPG KKTRRQILIL GGLAGSGFVG TVLTQIFFKE PSTENISSSD
QEALLEPADI STPPQQKTVT QEFTTVKLNN TGEIISRIFL ILGGLAGSGF VGTVLTQRFL
KEPSTENIST SDQEALLEPA DISTPPQQKT VTQEFTTVKL NNTGKIISRT KGKAEVMTEN
LGNRVSLEMV KIPGGRFLMG SPETEAGRRD NEDPQHYVDV PEFFMGKYVV TQAQWQAVMG
NNPAKFKGAS RPVERVSWND AIKFCQKLSQ ITGRKYSLPS ESQWEYACRA GTTTPFYFGE
TITPELVNYD GNYTYGNAPK GIYRKETTDV GIFPPNSFGL YDMHGNVWEW CADEWHDNYD
GAPTDGSVWL NGNKARSPLR GGSWSNNPLF CRSAVRLYYN RRDDHSLTFG FRLVCDGGRT
L