Gene Tery_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3098 
Symbol 
ID4244189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4749656 
End bp4750912 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content27% 
IMG OID638108113 
Productcapsular polysaccharide biosynthesis protein-like 
Protein accessionYP_722706 
Protein GI113476645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAC CTCAATTTAA TCGTGATTTA CTTAATGAAG AAAGACAAAA AGAATTAAAA 
GAATTAAAAA CTTATTTAAT TAATCATCCT GATCTACTTT CAATACCTGA TAATAATGAT
CACAATAATT TCTTAATTTT TTTATCTAAA ATATTAAATA CTATTCCCTT ATTTCGACTA
TTAAAACTTA GTCAAACTAT TCACCCAACA AAAATATTAG GTAATGGTGT AGAAAATTAT
CAAGTATTAT TTCCTGCTGA AAAAATAGAA TTTGGTGTTA ATGATATAGA TCGAGAATTT
CTCCAAAAAT GTCTATATTA TAACAATGAT TATTTTGAGC GATCACCTAT ATTTATTTGT
GAGATAATTC CGGCTTATAT TCACATTGGT TTAGGCACTA TTTGTACTAA AGATTTTCAA
GTAATAGTTG ATTCTGGAAT GCAATATAGA TTAACCATAG CCAGATGGAG ACTTAACTAT
CTTAACCGTC TTAAACTTTT ATTTGCTAAA CAGTTATCAG TAAAATCCGC TTATATCGTT
TCTTTATCTG CGGATAATTT TTGGCATTTT TTATTTGATT GTTTACCTCG AATTTATTCT
TTAATTTTAG CTAAATATAC AGATAAATTA ACTATATTAA TACCTGATTC TTTACGCAGT
TCATTTCGAG AGTTATTAAA GTATCTACTT CCTGAAAATT TTGAAATTAT GTATATCGAA
AATGGAACAT GGGTAAGAGT TGAACATTTA GTAATGCCCT CTTATGTTAG CCGTTGTGAA
AATGGATTTT TACCTCCAGA ATACTACGAA TATATTAAAA ATTGTGTCTT TGATAAACTT
AATCTTACAC CAGTGGAAAA ACTTACTGAA AGAATTTATA TCTCCCGTAG AAATGCCAAA
CATCGTCGCG TATTAAATGA AGAAAAACTA ATTCAATATT TAGAAAAATA TAATTTTAAA
ACAGTTTTTC TAGAAGATAT GTCATTTCCA GAACAAGTAG AACTATTTAC TAAAGTAGAA
ATGATTGTAG CTCCTCATGG TGCAGGTTTA GGTATAACTT TGTTCTCAGG AAAAATTAAA
ATATTAGTTC TTTATCCAGA AACTTCACCT ACTCCTTTTT TCTTTACTCA ATTTAAAGGA
CTTGGACAAA AACATTATTT TATTACCCAC GATCAATTTG ATGAAAATGC TGATTTTGAA
GTTGATATGG AGGAATTTCA GGAAATATTT GATAAAATGA TCTATGAGGC TGTTTGA
 
Protein sequence
MFKPQFNRDL LNEERQKELK ELKTYLINHP DLLSIPDNND HNNFLIFLSK ILNTIPLFRL 
LKLSQTIHPT KILGNGVENY QVLFPAEKIE FGVNDIDREF LQKCLYYNND YFERSPIFIC
EIIPAYIHIG LGTICTKDFQ VIVDSGMQYR LTIARWRLNY LNRLKLLFAK QLSVKSAYIV
SLSADNFWHF LFDCLPRIYS LILAKYTDKL TILIPDSLRS SFRELLKYLL PENFEIMYIE
NGTWVRVEHL VMPSYVSRCE NGFLPPEYYE YIKNCVFDKL NLTPVEKLTE RIYISRRNAK
HRRVLNEEKL IQYLEKYNFK TVFLEDMSFP EQVELFTKVE MIVAPHGAGL GITLFSGKIK
ILVLYPETSP TPFFFTQFKG LGQKHYFITH DQFDENADFE VDMEEFQEIF DKMIYEAV