Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1470 |
Symbol | |
ID | 7317081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1574275 |
End bp | 1575315 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643616361 |
Product | immunogenic protein (bcsp31-3) |
Protein accession | YP_002513541 |
Protein GI | 220934642 |
COG category | [R] General function prediction only |
COG ID | [COG2358] TRAP-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTCT CCAAGACCCG TTTCCTCAGT ACCCTGGTGG CCATCGGTGC CACCTTTGCC CTGTCGCTGG CGGCCATCCA GCCCGCCGAG GCCCAGCGTC AGACCATCCG TTTCGCCACA TCCAACGTCG GTTCCTACGG CTATGCCGTG GGTAACGTGA TCTCCGAGAT CCTGCAGCAG AACCTGGGCC GCGGTTATGC CGTGGTGGTG CAGCCCTATC CCTCCACCTC CGGCGCCATG CGCTCGGTGA TGGATGGCGA CGGCGAGTTC GGCTACACCG CTGACGTGGG CATGCGCGGC CTGTATGACG GCGAGCGTCC CTACGACAAC TACACCCCCC GCCGCGGCAT GCTGGTGCAC ACCCTGTACG TGTATCCCAT GGAGACCTTC ATGCTGGTGG CCGACCGCCG CAAGGGTGAG TTCAACTCCT ACAAGGACTT CGACGGTGCC CCGGGCTACT ACACCCCTGC CGGCTTCATG AACTGGCTCA ACATGCGTCG CGTGTTCGCC GCTCTGGGCT ATGAGTTCAA CCACGTGGAG ATCGACAACA CCACCGTGGC GGATGCCTTC GAGGCCAACA CCATCGCCGG TTCCGCGGGG TATACCACCG CCGGTGTCTC CCTGCCCACC TACTGGCGTG AGGCCGAGCT GCGCGCCAAC CTGGCCGCGG TGAACCCCTC CGAGGAGGAG ATGGAGAAGC TGCGTGCCGC GGGCCTGAAC CCGGTGCCTG TGGATCCCTC CAAGGCCTTC AGCCAGGAAC TGGGTGTGGA CACGATCTAC GCAGTGCCCA TCTACTTCGC CTACAACGTG CGCGCGGATA TGGATGCTGA CCTGATCAAG AACATCCTGG ACATCCTGTA CAAGAACAAG GACAAGCTGG TGGAAGGTGA TGCAGGCTTC GGTCCTCTGG CTGCCGACTT CGTGGGCACC CAGGCCGCCG GCGTGGCTGC CAACCCGGAT ATCCCGGTGC ATCCCGGCCT GGCCGCGTTC CTGAAGGAAC ACGGCGCCTG GAATGACAGC TGGACCATCG CGGGCCAGTA A
|
Protein sequence | MIFSKTRFLS TLVAIGATFA LSLAAIQPAE AQRQTIRFAT SNVGSYGYAV GNVISEILQQ NLGRGYAVVV QPYPSTSGAM RSVMDGDGEF GYTADVGMRG LYDGERPYDN YTPRRGMLVH TLYVYPMETF MLVADRRKGE FNSYKDFDGA PGYYTPAGFM NWLNMRRVFA ALGYEFNHVE IDNTTVADAF EANTIAGSAG YTTAGVSLPT YWREAELRAN LAAVNPSEEE MEKLRAAGLN PVPVDPSKAF SQELGVDTIY AVPIYFAYNV RADMDADLIK NILDILYKNK DKLVEGDAGF GPLAADFVGT QAAGVAANPD IPVHPGLAAF LKEHGAWNDS WTIAGQ
|
| |