Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3355 |
Symbol | |
ID | 4243449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5146012 |
End bp | 5147667 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108339 |
Product | extracellular solute-binding protein |
Protein accession | YP_722930 |
Protein GI | 113476869 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.188345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAC ATAAATCATT TAATTCAATC AACCATTCCT GGCAGTCAAT AATTCAATTT TTTGGCTTAT TTTGTCTTTG CTGTTTCCTG GTCATTAGTT GTAGCCAACC TCAAAATAAC CCCGACAATA CTCTAACTAT AGAAACAAAT ACAAATCGCA TTACTATAGG CACAACTTTA AAACCTCGGA CTATCGACCC GGCAGATGCC TACGAAGTAA TATCTGGTAA CTTACTCCAT AACTTAGGCG ATCGCCTCTA CGGTTATAAG CTAGGAACAA TGGAACTTGT ACCATCACTA GCAACAGAAA TGCCAAAAAT TAGTGAAGAT GGCACAACAT ATACTATTCC CATTCGTCAA GGGGTGACAT TCCATGACGG TACTCCTTTC AACGCTGAAG CAATGGCCTT TTCTTTCAAA CGTTTTATTA AAAATAGTGG CCCACCTTCT TCTTTGTTAA CTAATACTAT TAAATCAGTA GAAGCTACGG GAGAATACCA ATTAACAATT AAGTTGAAAA AACCCTTTGC AGCTTTTACT TCTTTATTAA CATTTTCTGG TCTGTGTGCT GTTTCCCCAC AAGCTTATGA AATAGGTGAA AGTCAATTTA AACCTGATAC ATTTATTGGT ACTGGTCCCT ATAAATTAGC TGAGTATGGT ACTGATACTT TACGCTTAGA TGTATTTGAA AATTATTGGG GAGAAAAACC AAAAAATCAA GGAATTGACA TTCAAATATT TTCTAGTTCA GCTAACCTAT TTAATGCTTT TAAAACAGGG TCTATTGATG TAGCTTATTT TTCTCTAGAC ACTGACCAAA TTACTAATTT AGAAGCTGAA GCTATCCGTC AAGGATGGCA AGTAATTTCT ACAGATGGTA AGACAGTTAA TTATATGGTT TTAAATCTCA ATTTAGAACC ATTAAATAAC AAAGCTGTTA GACAAGCTTT AGCATCTATT ATTGATAGAA AATTACTAAA TGAACGAGTT TTACAAGGAA AAGCTGAACC AGTTTATAGT CTAATTCCTA AACAAATTAA TAGCTATAAA CCAGTGTTTA AAGAAAACTA TGGAGATGGA AATTTTGCTC AAGCTAAGGA GTTATTAAAA GAGGCAGGAT ATTCTCGAAA TAGCCCAGCC AAAATTGAAA TTTGGTATGC TGCAAATTCT ACTAAGAGGC AATTAACAGC TAGTACATTA AAAGCATATG TAGATCAAAA TTTAGAAGGT TTAATGGAGT TAGAATTGAA TAGCGTAGAA GCAGCTACTG CTTTTAATAA TTTAGATAAA GGAGTATATC AAACATTTAT TTTAGACTGG TATGGAGACT TTCTTGATGC GGATAATTAT ATCCAACCAT TTTTAGAATG TATCAAAGGT TCAAAAGAAA AAGGTTGTGA AGAAGGGGCT AGTCAATTTC AAGGTTCATT TTATTATAGT GATCGGATAA ATAAATTAAT CCAACAACAA CGTCAAGAGC AAAACCCAGA AAAGCGGCTA GCTATTTTTG TAGAAATTCA AGAATTATTA GCAGAAGATG TTCCTTTTAT CCCATTATGG TTAGATAAAG ATTATGTATT TGCTCAGAAA AACATCAGTG GGGTTAGTTT AGAACCTACT CAGCAATTTT CTTTCTTGAA AATTAATAAA TCATAA
|
Protein sequence | MEKHKSFNSI NHSWQSIIQF FGLFCLCCFL VISCSQPQNN PDNTLTIETN TNRITIGTTL KPRTIDPADA YEVISGNLLH NLGDRLYGYK LGTMELVPSL ATEMPKISED GTTYTIPIRQ GVTFHDGTPF NAEAMAFSFK RFIKNSGPPS SLLTNTIKSV EATGEYQLTI KLKKPFAAFT SLLTFSGLCA VSPQAYEIGE SQFKPDTFIG TGPYKLAEYG TDTLRLDVFE NYWGEKPKNQ GIDIQIFSSS ANLFNAFKTG SIDVAYFSLD TDQITNLEAE AIRQGWQVIS TDGKTVNYMV LNLNLEPLNN KAVRQALASI IDRKLLNERV LQGKAEPVYS LIPKQINSYK PVFKENYGDG NFAQAKELLK EAGYSRNSPA KIEIWYAANS TKRQLTASTL KAYVDQNLEG LMELELNSVE AATAFNNLDK GVYQTFILDW YGDFLDADNY IQPFLECIKG SKEKGCEEGA SQFQGSFYYS DRINKLIQQQ RQEQNPEKRL AIFVEIQELL AEDVPFIPLW LDKDYVFAQK NISGVSLEPT QQFSFLKINK S
|
| |