Gene Achl_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2227 
Symbol 
ID7293695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2496367 
End bp2498184 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content65% 
IMG OID643590629 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002488281 
Protein GI220912972 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000000308788 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAATACAC CCAATGCAGT GCTGCCCCCT GCCCAAAACC ACCCTGCCGG AGCCACGCCT 
GAAGCACCGG TCACCCAGTC CCTGAAGTCC CACTCGCTGG CCTACATCGA TGATCCGCAG
CACGGCATCC GTGTCCCGGT AACGGAGATC GCGCTGGAGC CCTCGCCCAA CGGCCAGCCG
AACGCCCCTT TCCGCACGTA CCGGACAGCG GGCCCGGGAA GCGACCCCGT CCGGGGCCTC
AGCCCTTTCC GGTCGGGATG GATTGAAGGG CGGGATGACA CGGAAGAGTA CAGCGGAAGG
GCACGGAACC TGCTCGACGA CGGCCGCTCG GCTGTGCGCC GCGGCGCCGC CTCTGCGGAA
TGGAAAGGCG GGCGCCCGGT GCCCCGCCGC GCCGTCGACG GCCGGACAGT CACCCAGATG
CACTACGCCC GGAAGGGCGT GGTGACGCCG GAAATGCGGT TCGTGGCGCT GCGGGAAAAC
TGTGATCCGG AACTGGTCCG GAGTGAGGTT GCGGCTGGAC GGGCCATTAT CCCCGCCAAT
ATCAACCATC CGGAGTCCGA ACCGATGATC ATCGGCAAGG CTTTCCTGGT GAAGATCAAC
GCCAACATCG GCAACTCCGC CGTCACCAGT TCCATCAGGG AGGAGGTGGA CAAGCTGCAG
TGGGCCACCC GGTGGGGTGC CGACACGGTG ATGGACCTTT CCACGGGCGA CGACATCCAC
ACCACCAGGG AATGGCTCAT CCGCAATTCC CCCGTGCCGA TCGGCACCGT GCCCATCTAC
CAGGCCCTGG AAAAGGTCAA CGGCGAGGCG AACGCACTGA CGTGGGAAAT CTTCCGCGAC
ACCGTCATAG AACAATGTGA ACAGGGCGTG GACTATATGA CGGTCCACGC CGGCGTCCTG
CTCCGCTACG TGCCGCTGAC CGCCAACAGG GTCACCGGCA TCGTGTCCCG GGGCGGGTCC
ATCATGGCGG GATGGTGCCT GGCGCACCAC CAGGAGAACT TCCTGTACAC GCATTTCGAT
GAGCTGTGTG AGATCTTCGC CAAGTACGAT GTCGCGTTCT CGCTGGGTGA CGGCCTGCGC
CCCGGTGCCA CCGCCGATGC CAACGACGCT GCGCAGTTCG CGGAACTCGA CACGCTGGCT
GAGCTTACGG ACCGCGCCTG GAAACATGAC GTGCAGGTCA TGGTGGAAGG GCCGGGCCAC
GTCCCGTTTC ATCTGGTGCG GGAGAACGTG GAGCGCCAGC AGCAACTGTG CAAGGGCGCG
CCGTTCTATA CGCTGGGGCC GCTGGTCACC GATGTGGCCC CAGGCTACGA CCACATCACC
TCAGCCATCG GCGCCACCGA GATCGCGCGC TACGGCACCG CCATGCTCTG CTACGTCACA
CCCAAGGAGC ATCTGGGACT GCCCGACAGG GACGATGTGA AGACCGGAGT CATCACCTAC
AAAATCGCCG CGCACGCCGC CGACCTGGCC AAGGGCCACC CCGGGGCGCA CGAACGGGAC
GACGCCCTGT CCAAGGCCAG GTTCGAGTTC CGCTGGCGGG ACCAGTTTGC CCTTTCACTT
GACCCCGAGA CGGCCGAAGC CTTCCATGAT GAAACCCTTC CGGCCGAGCC CGCCAAGACG
GCACACTTCT GCTCCATGTG CGGGCCGAAG TTCTGCTCGA TGCGGATCAG CCAGGACATC
CGCAATGAGT ACGGATCCGC GGAAGCCCAG GCCGCGATAG CCGAAGCGGC ATCCGGGATG
CGGGAGAAGA GCCAGGAGTT CCTGGAATCC GGCGGCAAGG TGTACCTTCC CGAGCTGAAA
GTCCCGGCCG GCAGCTAA
 
Protein sequence
MNTPNAVLPP AQNHPAGATP EAPVTQSLKS HSLAYIDDPQ HGIRVPVTEI ALEPSPNGQP 
NAPFRTYRTA GPGSDPVRGL SPFRSGWIEG RDDTEEYSGR ARNLLDDGRS AVRRGAASAE
WKGGRPVPRR AVDGRTVTQM HYARKGVVTP EMRFVALREN CDPELVRSEV AAGRAIIPAN
INHPESEPMI IGKAFLVKIN ANIGNSAVTS SIREEVDKLQ WATRWGADTV MDLSTGDDIH
TTREWLIRNS PVPIGTVPIY QALEKVNGEA NALTWEIFRD TVIEQCEQGV DYMTVHAGVL
LRYVPLTANR VTGIVSRGGS IMAGWCLAHH QENFLYTHFD ELCEIFAKYD VAFSLGDGLR
PGATADANDA AQFAELDTLA ELTDRAWKHD VQVMVEGPGH VPFHLVRENV ERQQQLCKGA
PFYTLGPLVT DVAPGYDHIT SAIGATEIAR YGTAMLCYVT PKEHLGLPDR DDVKTGVITY
KIAAHAADLA KGHPGAHERD DALSKARFEF RWRDQFALSL DPETAEAFHD ETLPAEPAKT
AHFCSMCGPK FCSMRISQDI RNEYGSAEAQ AAIAEAASGM REKSQEFLES GGKVYLPELK
VPAGS