Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3098 |
Symbol | |
ID | 4244189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4749656 |
End bp | 4750912 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 638108113 |
Product | capsular polysaccharide biosynthesis protein-like |
Protein accession | YP_722706 |
Protein GI | 113476645 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAAC CTCAATTTAA TCGTGATTTA CTTAATGAAG AAAGACAAAA AGAATTAAAA GAATTAAAAA CTTATTTAAT TAATCATCCT GATCTACTTT CAATACCTGA TAATAATGAT CACAATAATT TCTTAATTTT TTTATCTAAA ATATTAAATA CTATTCCCTT ATTTCGACTA TTAAAACTTA GTCAAACTAT TCACCCAACA AAAATATTAG GTAATGGTGT AGAAAATTAT CAAGTATTAT TTCCTGCTGA AAAAATAGAA TTTGGTGTTA ATGATATAGA TCGAGAATTT CTCCAAAAAT GTCTATATTA TAACAATGAT TATTTTGAGC GATCACCTAT ATTTATTTGT GAGATAATTC CGGCTTATAT TCACATTGGT TTAGGCACTA TTTGTACTAA AGATTTTCAA GTAATAGTTG ATTCTGGAAT GCAATATAGA TTAACCATAG CCAGATGGAG ACTTAACTAT CTTAACCGTC TTAAACTTTT ATTTGCTAAA CAGTTATCAG TAAAATCCGC TTATATCGTT TCTTTATCTG CGGATAATTT TTGGCATTTT TTATTTGATT GTTTACCTCG AATTTATTCT TTAATTTTAG CTAAATATAC AGATAAATTA ACTATATTAA TACCTGATTC TTTACGCAGT TCATTTCGAG AGTTATTAAA GTATCTACTT CCTGAAAATT TTGAAATTAT GTATATCGAA AATGGAACAT GGGTAAGAGT TGAACATTTA GTAATGCCCT CTTATGTTAG CCGTTGTGAA AATGGATTTT TACCTCCAGA ATACTACGAA TATATTAAAA ATTGTGTCTT TGATAAACTT AATCTTACAC CAGTGGAAAA ACTTACTGAA AGAATTTATA TCTCCCGTAG AAATGCCAAA CATCGTCGCG TATTAAATGA AGAAAAACTA ATTCAATATT TAGAAAAATA TAATTTTAAA ACAGTTTTTC TAGAAGATAT GTCATTTCCA GAACAAGTAG AACTATTTAC TAAAGTAGAA ATGATTGTAG CTCCTCATGG TGCAGGTTTA GGTATAACTT TGTTCTCAGG AAAAATTAAA ATATTAGTTC TTTATCCAGA AACTTCACCT ACTCCTTTTT TCTTTACTCA ATTTAAAGGA CTTGGACAAA AACATTATTT TATTACCCAC GATCAATTTG ATGAAAATGC TGATTTTGAA GTTGATATGG AGGAATTTCA GGAAATATTT GATAAAATGA TCTATGAGGC TGTTTGA
|
Protein sequence | MFKPQFNRDL LNEERQKELK ELKTYLINHP DLLSIPDNND HNNFLIFLSK ILNTIPLFRL LKLSQTIHPT KILGNGVENY QVLFPAEKIE FGVNDIDREF LQKCLYYNND YFERSPIFIC EIIPAYIHIG LGTICTKDFQ VIVDSGMQYR LTIARWRLNY LNRLKLLFAK QLSVKSAYIV SLSADNFWHF LFDCLPRIYS LILAKYTDKL TILIPDSLRS SFRELLKYLL PENFEIMYIE NGTWVRVEHL VMPSYVSRCE NGFLPPEYYE YIKNCVFDKL NLTPVEKLTE RIYISRRNAK HRRVLNEEKL IQYLEKYNFK TVFLEDMSFP EQVELFTKVE MIVAPHGAGL GITLFSGKIK ILVLYPETSP TPFFFTQFKG LGQKHYFITH DQFDENADFE VDMEEFQEIF DKMIYEAV
|
| |