Gene Clim_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1669 
Symbol 
ID6353976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1835038 
End bp1836720 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content55% 
IMG OID642669274 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001943690 
Protein GI189347161 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCT TTTCGGACAA TATCTTCTGC CCTGAGCACC GCATGTACGG CAACGCTTCG 
AAAAAAACCC ACACAAAGGG ATGCATCCAC CCCATAGAAG TCGGCATGAG AACCCTCAGC
CTCACGAAAA CCTACACCTG CAGGGGAATC GAATTTTCAT CGATCCCGCT CTACGACACA
AGCGGACCCT ATTCCGACCC ATCCGTAATA ATAAATCCTG AAAAAGGACT GAATCCGCTT
CGCGACAAGT GGAAGTTCAA CAGCGAAAAC ACGGAACCTG TTTCTGAACA AGGCAATGAA
ACTGCAGCTG CCAGAGTGCC GCTTCGGGCA AAAAAAGGGT GCTCCGTCAC GCAGCTCGCC
TTTGCGCGCA AGGGAATCAT CACTCCCGAA ATGGAGTATG TGGCTATTCG CGAAAACCAG
CAGCTCGAAG AATGGATCGC ATCATTCCCG ATCGGCGGAA AAACCGCAGA ACCCTTCACC
GCAGAATTCG TACGGCAGGA AGTTGCGGCC GGCAGGGCCA TCATTCCTGC CAACATTAAC
CACCCCGAAC TTGAGCCGAT GATTATCGGA AGAAATTTCA GGGTAAAGAT CAATGCGAAC
ATCGGCAACT CCGCCATGGG GTCCTCTATC GAAGAAGAGG TCGAAAAAGC CGTATGGGCA
TGCCGCTGGG GCGCCGATAC CGTGATGGAC CTCAGTACAG GAACCAACAT CCACCAGACC
AGGGAGTGGA TACTGCGTAA CTCACCCGTT CCTATCGGCA CGGTTCCAAT GTACCAGGCG
CTTGAAAAAG CCGGAGGCGT TGCAGAAAAC CTCACCTGGG AACTCTACCG CGATACGCTC
GTCGAACAGG CGGAACAGGG AGTCGATTAC TTCACCATCC ATGCCGGTAT TCTGCAGGAG
CATTTGCCGG CCGCGGGCCG GCGCATGACC GGTATCGTGT CGCGAGGAGG TGCAATCATG
GCCAAATGGT GCAAAACCAA TAACCGGGAA AATTTCCTGT ACACCCATTT CGACGAGATC
TGCGAAATCT TGAGAAGCTA CGACATCGCC ATTTCGCTCG GCGACGCTTT GCGACCCGGC
TGTATTGCAG ACGCAAACGA CGAGGCTCAG TTCGGTGAAC TGAAAGTGCT CGGCGAACTG
ACCCTCCTGG CATGGGAGCA CGACGTGCAG GTAATGATCG AGGGACCGGG CCATGTACCT
CTCAATCTCG TGGAAGAGAA CATGCGGAAA CAGCTCGAAC TTTGCCACGG AGCCCCGTTC
TACACGCTCG GTCCGCTTAT TACCGATATT GCTGCCGGTT ACGACCACAT CAATTCTGCT
ATCGGCGGCA CACTGATTGC CGCATACGGC TGTTCCATGC TCTGCTATGT CACCCCGAAA
GAGCATCTCG GCCTGCCAGA CAAGAACGAC GTGAGAGAAG GCGTTGTGGT ACACAAAGTA
GCCGCACACG CTGCCGATAT TGCGAAAGGA AACCCGACCG CATGGCTGCA GGACGAACTG
ATGAGTCGCG CCCGATACGC ATTTGCCTGG GAGGATCAGT TCAATCTTTC GCTCGATCCC
GTAAAAGCCA GGGTGCTGTA CGCCGAAAGC AGGGCCGCAA GCGGACAGAC CGACGGGAAT
CCGGACTTCT GTACCATGTG CGGCCCGGAT TTCTGCTCCA TGAAACGCTC GCAGGAAAAG
TGA
 
Protein sequence
MNTFSDNIFC PEHRMYGNAS KKTHTKGCIH PIEVGMRTLS LTKTYTCRGI EFSSIPLYDT 
SGPYSDPSVI INPEKGLNPL RDKWKFNSEN TEPVSEQGNE TAAARVPLRA KKGCSVTQLA
FARKGIITPE MEYVAIRENQ QLEEWIASFP IGGKTAEPFT AEFVRQEVAA GRAIIPANIN
HPELEPMIIG RNFRVKINAN IGNSAMGSSI EEEVEKAVWA CRWGADTVMD LSTGTNIHQT
REWILRNSPV PIGTVPMYQA LEKAGGVAEN LTWELYRDTL VEQAEQGVDY FTIHAGILQE
HLPAAGRRMT GIVSRGGAIM AKWCKTNNRE NFLYTHFDEI CEILRSYDIA ISLGDALRPG
CIADANDEAQ FGELKVLGEL TLLAWEHDVQ VMIEGPGHVP LNLVEENMRK QLELCHGAPF
YTLGPLITDI AAGYDHINSA IGGTLIAAYG CSMLCYVTPK EHLGLPDKND VREGVVVHKV
AAHAADIAKG NPTAWLQDEL MSRARYAFAW EDQFNLSLDP VKARVLYAES RAASGQTDGN
PDFCTMCGPD FCSMKRSQEK