Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0909 |
Symbol | |
ID | 6374576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 983175 |
End bp | 984866 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683411 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001959335 |
Protein GI | 189499865 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.462945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTT CACCGGATAA CCTCTTTTGT CCTGAACAGA ACTTTTACGG ACCGGATTCA GAAAAAATCT ATATCGATGG CTCCCTTCAC CCCGTCAAAG TGGGTATGCG GAGAATCAAG CTGTCAAAAA CCTATACACT GCATGGCACT GATTTTTCAT CTTTCCCTCT CTATGATACC AGCGGCCCGT ATTCTGATCC GTCTGTAACC ATCGACCTGC ACAAAGGACT CCCCTCAACT CGGGACTTCT GGCAAAAGAA CAGGACGGAT ATTGAAGTCT GTCCCGGCAA AAATCCCTCG CCAATGAACA ACAGAACTCC TGTCAGGGCA AAACAGGGGA AATCTGTTAC ACAGATGCAT TACGCCAGGA AAGGCATCAT TACTCCCGAA ATGGAATATG TCGCCATCAG GGAAAACCAG CAGCTCGAGG AGTGGATTGA AAGGTTTTCA TCAAACGGCA GTTCTGTGAA GCCGGTTACG CCGGAATTTG TTCGCGACGA GATCGCAAAA GGAAGAGCGA TTATTCCGGC AAACATCAAC CATCCTGAAC TGGAACCGAT GGCTATCGGG AGAAACTTCC GGGTCAAGAT AAACGCGAAC ATCGGAAATT CTGCCCTTGC ATCATCTATC AGTGAAGAGG TTGAAAAATC TGTCTGGGCA TGCCGATGGG GAGCGGACAC CGTGATGGAC CTGAGTACAG GAAAAAATAT CCACCAGACG CGGGAGTGGA TTCTCCGGAA CTCCCCTGTT CCCATAGGCA CAGTGCCGAT ATATCAGGCA CTTGAAAAAG TTGGAGGTAA AGCCGAAGAG CTGAACTGGA ACATCTACCG TGATACGCTT ATCGAACAGG CCGAACAGGG GGTTGATTAT TTCACCATTC ACTCCGGCAT TCTTCTTGAT TTTCTTCCTG CCGCACAACG AAGAACCACC GGCATCGTCT CGCGCGGGGG ATCGATTATC GCCAAATGGT GCCGGGCGCA TAAACAGGAA AACTTTCTTT ACTCCCACTT TGATGATATC TGTGACATAC TCAGATCGTA TGATATCGCG ATCTCCATCG GAGATGCCCT TCGTCCCGGA TCAATTGCAG ACGCCAATGA CGAGGCTCAG TTCAGTGAAC TGAAAACTCT CGGCGAGCTG ACCCTGAAAG CCTGGAAATA CGATGTTCAG GTAATGATAG AAGGCCCCGG TCATGTCCCG CTCAATCTTG TTGAGGAGAA CATGCGAAAG CAGCTCGAAT ACTGCCATGA AGCCCCGTTC TACACGCTGG GGCCACTGGT TACCGACATA GCCGCCGGGT ATGACCATGT CAACTCCGCG ATCGGAGGGA CACTGCTGGC AAGCCTTGGC TGCGCGATGC TCTGTTATGT CACCCCCAAG GAACATCTGG GCCTTCCTGA TAAAAATGAT GTACGGGAGG GGGTTATCGT GCATAAACTT GCGGCCCATG CCGCCGACAT TGCCAAAGGC AGTCCATCAG CCCTTCTGCA TGACAAACTG ATGAGCAGTG CCCGATACTC ATTCGCATGG AACGACCAGT TCAATCTCTC TCTTGACCCG GTCAAAACCA GACAGGTTCA TGCGGAGAGT TCACAGCAAA ATACAGGGGA TGGCACCGAC GACCACTTCT GCACCATGTG CGGACCTGAC TTCTGTTCAA TGAAGAAGTC GCAGGAAGTT ACGGGGAAAT AG
|
Protein sequence | MSTSPDNLFC PEQNFYGPDS EKIYIDGSLH PVKVGMRRIK LSKTYTLHGT DFSSFPLYDT SGPYSDPSVT IDLHKGLPST RDFWQKNRTD IEVCPGKNPS PMNNRTPVRA KQGKSVTQMH YARKGIITPE MEYVAIRENQ QLEEWIERFS SNGSSVKPVT PEFVRDEIAK GRAIIPANIN HPELEPMAIG RNFRVKINAN IGNSALASSI SEEVEKSVWA CRWGADTVMD LSTGKNIHQT REWILRNSPV PIGTVPIYQA LEKVGGKAEE LNWNIYRDTL IEQAEQGVDY FTIHSGILLD FLPAAQRRTT GIVSRGGSII AKWCRAHKQE NFLYSHFDDI CDILRSYDIA ISIGDALRPG SIADANDEAQ FSELKTLGEL TLKAWKYDVQ VMIEGPGHVP LNLVEENMRK QLEYCHEAPF YTLGPLVTDI AAGYDHVNSA IGGTLLASLG CAMLCYVTPK EHLGLPDKND VREGVIVHKL AAHAADIAKG SPSALLHDKL MSSARYSFAW NDQFNLSLDP VKTRQVHAES SQQNTGDGTD DHFCTMCGPD FCSMKKSQEV TGK
|
| |