Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0487 |
Symbol | |
ID | 7317671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 518549 |
End bp | 520438 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615370 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002512571 |
Protein GI | 220933672 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.200164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCA TCCCCCAGGA CTTCATCGCC AAGACCGCCC GGCTCTCCGA GGAGATCACC CGGCCGTTCC CCAATTCCCG CAAGATCTTC GTGCAGGGTT CGAGCCCGGA CATTCGCGTG CCCATGCGGG AGGTGAGCCA GGCGCCCACC CAGGCCTCCA TGGGGGCGGA GGAGAACCCG CCCATCACCG TCTACGACAC CTCCGGCCCC TACACGGACC CGGCCGCGTC CATCGACCTG ATGCAGGGTC TGCCGGCGCT GCGCGAGGCC TGGATCGAGT CGCGCCAGGA CACCGAGCGC CTGGACGGGC CCAGCTCCGA GTACGGCCGC GCCCGGGCCG CCGACCCGCA GCTGGCCGCG CTGCGCTTCG AGCACATCCG TGCGCCGCGC CGGGCGCGCC CTGGCCTGAA CGTCACCCAG ATGCACTACG CCCGCCGGGG CATCATCACT CCCGAGATGG AGTTCGTCGC CATTCGCGAG AACGCCCGCC TGGAGGAGAT GCGCGCCGAT CCCCGCTACG CCAAGCTGCT GCGCCAGCAC GCCGGCCAGC CCCTCGGCGC GAAGATCCCG GAGCTGATCA CGCCCGAGTT CGTGCGTGAC GAGGTGGCCG CGGGCCGGGC CATCATCCCC AGCAACATCA ACCACCCGGA ACTGGAGCCC ATGATCATCG GGCGCAATTT CCGGGTGAAG ATCAATACCA ACATCGGCAA CTCCGCGGTC ACCTCCAGCA TCGAGGAGGA GGTGGAGAAG CTGGTGTGGT CCTGCCGCTG GGGCGGCGAC ACCCTCATGG ACCTGTCCAC GGGCAAGAAC ATCCACGAGA CCCGTGAATG GATCCTGCGC AACAGCCCCG TGCCCATCGG CACCGTGCCC ATCTACCAGG CCCTGGAGAA GGTCAACGGC AAGCCCGAGG AGCTGACCTG GGAGCTGTTC CGGGACACCC TGATCGAGCA GGCGGAGCAG GGGGTGGACT ACTTCACCAT CCACGCCGGC GTGCTGCTGC GCTACGTGCC GCTCACCGCC GACCGGGTCA CGGGCATCGT CTCCCGGGGC GGCTCCATCA TGGCCAAGTG GTGCCTGGCC CATCACCGCG AGAACTTCCT CTACACCCAC TTCGAGGAGA TCTGCGAGAT CATGAAGGCC TACGACGTGG CCTTCAGCCT GGGCGACGGC CTGCGCCCAG GCTGTCTCGC GGATGCCAAC GACGCCGCCC AGTTCGGCGA ACTGGAGACC CTGGGTGAGC TGACCAAGAT CGCCTGGGAG CATGACGTGC AGGTGATGAT CGAGGGCCCG GGCCACGTGC CCCTGCACAT GGTCAAAGAG AACGTGGACA AGGAGCTCAG CGACTGCTTC GAGGCGCCCT TCTACACCCT GGGCCCCCTG GTCACGGACA TCGCGCCGGC CTACGACCAC ATCACCTCCG GCATCGGTGC CGCCAACATC GGCTGGTACG GCACCGCCAT GCTCTGCTAC GTGACCCCCA AGGAGCACCT GGGCTTGCCC AACAAGCAGG ACGTGCGCGA CGGCATCATC ACCTACAAGA TCGCCGCCCA TGCCGCCGAT CTCGCCAAGG GCTTCCCCGG CGCTCAGGTG CAGGACAATG CGCTTTCCAA GGCCCGCTTC GAGTTCCGCT GGGAGGACCA GTTCAACCTG GCCCTGGACC CGGAGCGGGC CCGGGAATTC CACGACGAGA CCCTGCCCAA GGAGGCCCAC AAGGTGGCGC ACTTCTGCTC CATGTGCGGG CCCAACTTCT GCTCCATGAA GATCACCCAG GACGTGCGTG ATTATGCCGT CAAGCAGGGC ATCTCCGAGC AGGAGGCCCT GGAGAAGGGC ATGCAGGAGA AGGCGGTGGA GTTCGTGAAC AAGGGCGCGC AGATCTACCG GGAGGTTTGA
|
Protein sequence | MSAIPQDFIA KTARLSEEIT RPFPNSRKIF VQGSSPDIRV PMREVSQAPT QASMGAEENP PITVYDTSGP YTDPAASIDL MQGLPALREA WIESRQDTER LDGPSSEYGR ARAADPQLAA LRFEHIRAPR RARPGLNVTQ MHYARRGIIT PEMEFVAIRE NARLEEMRAD PRYAKLLRQH AGQPLGAKIP ELITPEFVRD EVAAGRAIIP SNINHPELEP MIIGRNFRVK INTNIGNSAV TSSIEEEVEK LVWSCRWGGD TLMDLSTGKN IHETREWILR NSPVPIGTVP IYQALEKVNG KPEELTWELF RDTLIEQAEQ GVDYFTIHAG VLLRYVPLTA DRVTGIVSRG GSIMAKWCLA HHRENFLYTH FEEICEIMKA YDVAFSLGDG LRPGCLADAN DAAQFGELET LGELTKIAWE HDVQVMIEGP GHVPLHMVKE NVDKELSDCF EAPFYTLGPL VTDIAPAYDH ITSGIGAANI GWYGTAMLCY VTPKEHLGLP NKQDVRDGII TYKIAAHAAD LAKGFPGAQV QDNALSKARF EFRWEDQFNL ALDPERAREF HDETLPKEAH KVAHFCSMCG PNFCSMKITQ DVRDYAVKQG ISEQEALEKG MQEKAVEFVN KGAQIYREV
|
| |