Gene Arth_2487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2487 
Symbol 
ID4445076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2781177 
End bp2783057 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content66% 
IMG OID639690302 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_831966 
Protein GI116671033 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.320621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATACAC AAGAAACACA GCTGATCCCT GCCCAAAACG AGGCCGCCCC CGGCAATTCG 
GCACCTGCCG AAGCGCAGTC CCTGAAGTCG CACTCGCTGG CATTCATCAC CGATGAAGCC
ACCGGGATCC GGGTGCCGGT GACCGAAATC GCCCTTGAGG ACTCACCGGG CGGGGCAGCC
AACCCGCCGT TCCGCGTGTA CCGGACTGCC GGGCCCGGCA GCGATCCCGT GGTGGGCCTG
GAACCGTTCA GGACCCGGTG GATCGAGTCG CGGGCCGACA CCGAGCCTTA TGGCGGCCGG
GAACGGAACC TGCTCGACGA CGGCCGGTCG GCCGTGCGCC GCGGCGCCGC ATCCGCGGAG
TGGAAGGGCG CGCAGCCCGT GCCCCGCCGC GCCGTCGAAG GCAGGACTGT CACGCAGATG
CACTACGCCC GGCAGGGTGT CGTGACGCCG GAGATGCAGT TCGTTGCCCT CCGCGAAAAC
TGCGACGTGG AGCTGGTCCG CAGCGAAGTG GCTGCCGGCC GCGCCATCAT CCCCAACAAC
ATCAACCACC CGGAGTCCGA ACCGATGATC ATCGGCAAGG CCTTCCTGGT GAAGATCAAC
GCCAACATCG GCAACTCGGC CGTCACGAGC TCCATCGCGG AGGAGGTCGA CAAGCTGCAG
TGGGCCACAC TGTGGGGCGC CGACACCGTG ATGGACCTGT CCACCGGCGA TGACATCCAC
ACCACCCGTG AGTGGATCAT CCGCAACTCC CCCGTGCCGA TCGGCACCGT TCCCATCTAC
CAGGCACTGG AAAAGGTCAA CGGCGAGGCC AACAAACTGA CGTGGGAAAT TTTCCGCGAC
ACCGTGATCG AGCAGTGCGA GCAGGGCGTG GACTACATGA CCATCCACGC CGGCGTGCTG
CTGCGGTATG TGCCCCTGAC CGCCAACCGG GTGACCGGCA TCGTCTCCCG CGGCGGCTCC
ATCATGGCCG GCTGGTGCCT TGCCCACCAC CAGGAGAACT TCCTGTACAC GCACTTCGAC
GAGCTGTGCG AAATTTTCGC CAAGTACGAC GTCGCGTTTT CGCTGGGCGA CGGGCTGCGG
CCCGGTGCGA CGGCGGACGC GAACGACGCC GCCCAGTTCG CCGAGCTGGA TACCCTGGCC
GAACTGACGC AGCGCGCCTG GGAGTTCGAC GTGCAGGTGA TGGTGGAAGG ACCCGGCCAC
GTGCCGTTCC ACCTGGTCCG TGAAAACGTG GAACGCCAGC AGGAACTCTG CAAGGGAGCA
CCGTTCTACA CGCTGGGGCC GCTGGTCACG GACATAGCCC CGGGCTACGA CCACATCACC
TCCGCCATCG GCGCCACGGA AATCGCCCGC TACGGCACGG CCATGCTCTG CTACGTCACG
CCCAAGGAAC ACCTGGGGCT GCCAAACAAG GACGATGTCA AGACAGGCGT CATCACCTAC
AAGATCGCCG CCCACGCCGC CGACCTCGCC AAGGGCCACC CCGGCGCGCA CCAACGCGAC
GACGCCCTGT CCAAGGCCCG GTTCGAATTC CGCTGGCGGG ACCAGTTCGC CCTCTCGCTG
GACCCGGTCA CCGCCGAATC CTTCCATGAC GAGACGCTGC CCGCGGAGCC AGCCAAGACC
GCGCACTTCT GCTCCATGTG CGGGCCCAAG TTCTGCTCAA TGCGCATCAG CCAGGACATC
AGGGACGAGT ACGGTTCCGC CGAGGCACAG TCGGCACTCG CCGAGATGGC GGCAGGCATG
CGTGAAAAGA GCAACGAATT CCTCGCAGCC GGCGGCAAGG TCTACCTACC CGAGCTGCAG
CTTCCAGACC CGGAACGACC GGGCCGGCAC GGTGCAGCGA CGGGCGACGC TACGACGCCC
GTGAGTGCTG ACGCCTGCTG A
 
Protein sequence
MNTQETQLIP AQNEAAPGNS APAEAQSLKS HSLAFITDEA TGIRVPVTEI ALEDSPGGAA 
NPPFRVYRTA GPGSDPVVGL EPFRTRWIES RADTEPYGGR ERNLLDDGRS AVRRGAASAE
WKGAQPVPRR AVEGRTVTQM HYARQGVVTP EMQFVALREN CDVELVRSEV AAGRAIIPNN
INHPESEPMI IGKAFLVKIN ANIGNSAVTS SIAEEVDKLQ WATLWGADTV MDLSTGDDIH
TTREWIIRNS PVPIGTVPIY QALEKVNGEA NKLTWEIFRD TVIEQCEQGV DYMTIHAGVL
LRYVPLTANR VTGIVSRGGS IMAGWCLAHH QENFLYTHFD ELCEIFAKYD VAFSLGDGLR
PGATADANDA AQFAELDTLA ELTQRAWEFD VQVMVEGPGH VPFHLVRENV ERQQELCKGA
PFYTLGPLVT DIAPGYDHIT SAIGATEIAR YGTAMLCYVT PKEHLGLPNK DDVKTGVITY
KIAAHAADLA KGHPGAHQRD DALSKARFEF RWRDQFALSL DPVTAESFHD ETLPAEPAKT
AHFCSMCGPK FCSMRISQDI RDEYGSAEAQ SALAEMAAGM REKSNEFLAA GGKVYLPELQ
LPDPERPGRH GAATGDATTP VSADAC