Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3081 |
Symbol | |
ID | 7316011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3227252 |
End bp | 3228232 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643617980 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002515137 |
Protein GI | 220936238 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAAC CCCTCAAGCG ACTCTTCGGT GCCCTGGTGG GCATCACCAT GCTGGGCATG TCCGCCCAGG CCCTGTCCGA ACCCCTGAAG ATCGGCTACA GCGACTGGCC CGGCTGGGTC GCCTGGGAAG TGGCCATCGA GAAGGACTGG TTCAAGGAAG AAGGGGTGGA CGTGAAGTTC GAATGGTTCG ACTACGTGGC CTCCATGGAC GCCTTCGCCG CCGGCCAGCT GGATGCCGTG GCCATGACCA ATGGCGATGC CCTGGTCACC GGCGCCACCG GCGCGCGCAA CGTGATGATC ATCATCAACG ACTACTCCAA CGGCAATGAC ATGATCATCG CCCGGCGCGG CATCAGCTCC GTGGCCGATC TGAAGGGCAG GAAGATCGGC GTGGAGATAG GCTTCGTCTC CCACCTGCTG CTGCTCAACG CCCTGGAGAA GAACGGCATG AGCGAATCGG ACGTGGAGCT GATCAACGTG CCCACCAATG AGACGCCCCA GGTGCTCGCC TCCGGCAACG TGGACGCCAT CGTCGCCTGG CAGCCCAGCT CCGGCCAGGC CCTGAACCTG GTGCCCGGCT CCACCGCCAT CTACACCAGC GCCGACGAAC CCGGTCTGAT CTACGACGTG CTGGCGGTCT CCCCCACCAG CCTGGCCGCG AACCGTGACG CCTGGATCAA GGTGGCGCGG GTCTGGTACC GGGCGGTGGA TTACATCCAG GATCCGGCCA CCCGCGCCGA CGCGGTGCGC ATCATGGCCG CGCGCGTGGG CATCCCGCCC GCCGACTACG AGGGCTTCAT CGAGGGCACC AAGATCCTGA CCCGTGAAGA GGCCATGGAA TTCTTCACGA AGGGCGACGG CTTCACCTCC CTGTACGGCT CCTCGAAGAT CGCCGACGAG TTCAACGTCG CCAATGACGT GTATACCGAA CCGCAGCCCG TCGAGGACTA CATCGACGCC AGCATCACCG GCGCCCTGTA A
|
Protein sequence | MFKPLKRLFG ALVGITMLGM SAQALSEPLK IGYSDWPGWV AWEVAIEKDW FKEEGVDVKF EWFDYVASMD AFAAGQLDAV AMTNGDALVT GATGARNVMI IINDYSNGND MIIARRGISS VADLKGRKIG VEIGFVSHLL LLNALEKNGM SESDVELINV PTNETPQVLA SGNVDAIVAW QPSSGQALNL VPGSTAIYTS ADEPGLIYDV LAVSPTSLAA NRDAWIKVAR VWYRAVDYIQ DPATRADAVR IMAARVGIPP ADYEGFIEGT KILTREEAME FFTKGDGFTS LYGSSKIADE FNVANDVYTE PQPVEDYIDA SITGAL
|
| |