Gene Tery_3593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3593 
Symbol 
ID4244226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5529266 
End bp5532559 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content37% 
IMG OID638108555 
ProductPEP-utilising enzyme, mobile region 
Protein accessionYP_723144 
Protein GI113477083 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0344] Predicted membrane protein
[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTT ACCTTGTGGC GCACATTTGT GGATTGAAAC TTACAAGCCA GTGCAGTTTG 
TGGATACTCG TTTTAAACCT AGGAGTGCTA GGCGACCGTA CAAAGATATC TGTTGATATG
CAAATATTAA TTCCAGCATT ATTTCTAACT GTCTGCTTTG GTCTAGGCTC TCTTCCTCTA
ACTAATTTAA TAGTTAAAAA TCTGGGAAAT ATTAACCTAA CAAAAGTTGG TACAGGTAAC
CCTAGCGTGG CAGCCGCATT TGTTCACGCA CAAAAATCTG TGGCAATATT TGCAGTCTTA
GCAGAAATTA TTAGAGGTAT TACTCCTGTT TTAGTAGCAA AATTATTATT TCCAGAAATT
TCTACTTGGC AATTAGTAGG ATTAATATTT TTAGTGGCAG GTAGATATTT CATTGCTCAA
GGTGGCGGTG TTACTAATGC TAGTTGGGGC GTTTTGATTT ATTCTCCCGT TGTTGCATTG
GCTTCGGGAA TTACAGGTTT TTCAATATTA GTCATAGGTA AACAAGTCTT TCCTCACAAG
AATCAAAATA TTCAAAAATG GTCGGCTCGT TTGGGATGTC TCAGTAGTTT TTTCTGGGTT
TTATTCTTTC GCACCTCAGC ATCTTTATTA GAATTATTCG CAGCAGCAGG ATTAGCAATT
TTCTTAGTAG TAATAAACTT TTTCCAGAGT GATGATATGG CTTTACAAAA ACGACCTATA
TTTTCTCTCA ATAGTCAACT TGATGCTAAA ATTTGTGGAG AAAAAGCAGC TAGATTAGCT
TTACTAAAAA AATCCGGTTT TAATGTTCCT GAAGGTTTTG TTTTACCTGC AATTGAAAAT
GGAAATAGGA AATTGGTTGT GGAGAGTGAA ATGACAAAAG CTATTCCTAA TACTCGACTC
CCAACTGGCT TTTTCTCAGT AGGTAAAATT TTAGATAAGT TAGATAAAAT GTCTGGGAGT
TTTCCATTAA TTGTTCGTTC TTCAGCAGTA GGGGAAGATA GTGATAATAG TTCTGCTGCC
GGACAATATG AAACTATTTA TCCTGTGACA AATGAAACGG AATTATTAGA GGCAATTAAT
ATTTGTCGTC AGTCTTATTG GTTACCAGAA GCTATTGCTT ATCGGCAACA AAGAGAAATT
CCTGATGGGG AAATGGCGGT TTTGATTCAG CCTTATATTA TGAGTCAAGT TGCTGGAGTA
ATGTTTACTA GAAATCCTGT GGATGGTAGT GCAAAAATAA TTATTGAAGC TTTGCCTGGA
GGTGCAGCAA AAGTTGTGGG TGGGCGGTTG ACTCCCCTAC ATTTAGAAAT TGATAAAAAT
AGATTTCACG AAATTAAAAA TTCCACAGTC AAAGGTAAAA GTTATCCTTC TATTAATGAA
TATGCTGATT TTTTTACTAA ATTAGAAAAT CAGGATATTT TACTTCCAGA AATTATTCAA
GAATTGGTAA GTAAAGCAGA AGCTATTGAA GAATTTTTTC ATGGTTTGCC TCAAGATATT
GAATGGTGTT GGGATGGGGA AAAAATTTGG ATTTTGCAAA GTCGCCCAAT TACTAATCTG
AGACCTATTT GGACTCGAAC TATTGCAGCA GAAGTAATTC CAGGGGCTAT TCATCCTTTA
ACTTGGTCGA TAAATCGTCC TCTGACTTGT GGGGTTTGGG GAGAAATTTT TACTATTGTT
TTAGGGGAAA AGGTAGCAAA GTTAGACTTT ACTGAAACGG CAACTTTACT GGGTTCTCAT
GCTTATTTTA ATGCTACTTT GTTAGGGGAA ATTTTTAGAA TGATGGGATT GCCGGAACAG
GGTTTAGAGT TTTTATTACG AGGGCAAAAA ATGGGAAAAC CTCCTTTAGA AAAAGTCTGG
TCTGCTTTGC CTGGTTTATG GCGTTTGATT CAAAAAGAAA TGGCAATTGA TGACGAGTTT
GAGCGTGACT ATCATCAAAT TTTTGTTCCC GCACTACAAA GTTTGGAAAA TAATTTTCAA
CATAATTTTC CGGAAAATTC GTCTCAGTGT TTATCAGAAT TATTGGTTCA AGTAGAAAGA
ATTCAGGAAT GGTTAAAACC GATTACTTTT TACAATATTT TGGGTCCTAT AGGTTTAGGG
ATTAGGCGAT CGATCTTCGG TGTTTCTGAA GAATGGTTAC CTACTGATAC TGCTCCTGAA
ATTGTTTCAA TTCGAGAATT ACAAAGGTTG GCAGTTAAGT TGAAAATGGC AACTAAATCT
GATGTAGAAA TAGAGGCAGA ATTTCAGCAA AATCAGGAAT TACAATTGGA GTTTCAGCGA
TGGTTGAAAA GTTATGGTTA CTTGAGCGAG GTTGGTACGG ATATTGCTGT GGTAACTTGG
GGAGAAAAGC CGGATAATTT CCGCGATTTA TTATTTGCAA TGGCTCGGAA AGGTGAGAGA
GAAAAAGATA ATTTTTCGGT AAGGTCTTTA AATTTTTGGC AGAGTTGGCG ACTGGGAAAA
TGTAGAAAAC GTGCTAGAAC TAAGGGTAAA ATTGCTGAAG TTTATGGAAA ACTTTTGGCT
GATTTACGAC GGACGTTTTT GGCAATGGAA AATTATGGAT TAGAAGCAGG AATTTTTGAG
CAAAAAGGAG ATATTTTTTA TTTGGAATTT GCCGAAGTTA GGGAATGGAT TTTATCTACA
GTGGGTTTGG TCTCAGAAGG AAACCTAAAA AAGTTTAATA CAGAAGCTCC TGTAGGTTTG
GTCTCAGAAC AAAATCCAAC AAATAGTTTA GGAGTCTCGG AACAAAACAG AAAAGGATTT
AATGTAGTTT CTCCCCACAC TTCTCAAATT CCCCAAACTT CGCACAAGCA AGTGATTATC
AAGAATGAAA TAAGTTATCT CAATCTCAGA GAACTTATCC GTCAAAGGCA GGAACAGTTG
GAAAAAGATG GCGATCGCCT AGTTCCCTCT GTAGTTTATG GGAATGTATT ACCCCAAGAA
AGTCAGGAAA CTTTCTCTAG TGAGGAAAAT GTTTCTTTTT TACAAGGGAT ACCAGGGAGT
GTTGGGTGTG TGGAAGGTTA TATCAAGGTT TGTCGGAGTT TAGAAGTAAA TCTTACTGAG
GATGAAGGTT TAATTATAGT TGTGCCTTAT ACGGATGCTG GTTGGGCGCC TTTGTTATTG
AAGGCAAAGG GTATTATTGC TGAAGTGGGG GGGCAGTTAT CCCACGGAGC TATTATTGCG
CGGGAGTATG GCATTCCGGC AGTAATGAAT ATTTCTGGAG CGATGACTCG TTTACAGGAT
GGGCAGAAAG TTAGGGTAGA TGGGTTTCGG GGTACTGTGG AATTGTTATT TTAA
 
Protein sequence
MNPYLVAHIC GLKLTSQCSL WILVLNLGVL GDRTKISVDM QILIPALFLT VCFGLGSLPL 
TNLIVKNLGN INLTKVGTGN PSVAAAFVHA QKSVAIFAVL AEIIRGITPV LVAKLLFPEI
STWQLVGLIF LVAGRYFIAQ GGGVTNASWG VLIYSPVVAL ASGITGFSIL VIGKQVFPHK
NQNIQKWSAR LGCLSSFFWV LFFRTSASLL ELFAAAGLAI FLVVINFFQS DDMALQKRPI
FSLNSQLDAK ICGEKAARLA LLKKSGFNVP EGFVLPAIEN GNRKLVVESE MTKAIPNTRL
PTGFFSVGKI LDKLDKMSGS FPLIVRSSAV GEDSDNSSAA GQYETIYPVT NETELLEAIN
ICRQSYWLPE AIAYRQQREI PDGEMAVLIQ PYIMSQVAGV MFTRNPVDGS AKIIIEALPG
GAAKVVGGRL TPLHLEIDKN RFHEIKNSTV KGKSYPSINE YADFFTKLEN QDILLPEIIQ
ELVSKAEAIE EFFHGLPQDI EWCWDGEKIW ILQSRPITNL RPIWTRTIAA EVIPGAIHPL
TWSINRPLTC GVWGEIFTIV LGEKVAKLDF TETATLLGSH AYFNATLLGE IFRMMGLPEQ
GLEFLLRGQK MGKPPLEKVW SALPGLWRLI QKEMAIDDEF ERDYHQIFVP ALQSLENNFQ
HNFPENSSQC LSELLVQVER IQEWLKPITF YNILGPIGLG IRRSIFGVSE EWLPTDTAPE
IVSIRELQRL AVKLKMATKS DVEIEAEFQQ NQELQLEFQR WLKSYGYLSE VGTDIAVVTW
GEKPDNFRDL LFAMARKGER EKDNFSVRSL NFWQSWRLGK CRKRARTKGK IAEVYGKLLA
DLRRTFLAME NYGLEAGIFE QKGDIFYLEF AEVREWILST VGLVSEGNLK KFNTEAPVGL
VSEQNPTNSL GVSEQNRKGF NVVSPHTSQI PQTSHKQVII KNEISYLNLR ELIRQRQEQL
EKDGDRLVPS VVYGNVLPQE SQETFSSEEN VSFLQGIPGS VGCVEGYIKV CRSLEVNLTE
DEGLIIVVPY TDAGWAPLLL KAKGIIAEVG GQLSHGAIIA REYGIPAVMN ISGAMTRLQD
GQKVRVDGFR GTVELLF