Gene Arth_0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0822 
Symbol 
ID4446660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp888827 
End bp890791 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content67% 
IMG OID639688629 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_830320 
Protein GI116669387 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.837061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGACGA GAACTGTAAC CGCAGGAACC CGCCGGATGA CGGTCGCCCA GGCCGTCGTC 
GAATATCTAT CCAAGCAGTA CACCGTTGAT TCTGTTGGCG GGCTGGATTA CCGCGAACGC
CTGATCCCCG GCACGTTCGG CATCTTCGGG CACGGCAACG TTGCCGGAGT GGGCCAGGCG
CTCAAGCAGT ACCAGCAGCT GGATCCGGCG ATCATGCCGT ACTACCAGGG CCGCAACGAG
CAGGCGCAGG CGCACCAGGC TGTGGGCTAC GCGCGCCACA CCCGCCGCCG GCAGACCTTT
GCGATCAGCA CCTCGATCGG GCCGGGTTCC TCCAACCTGC TCACGGGGGC GGCGCTGGCC
ACCACCAACC GCCTGCCGGT CCTGCTGCTT CCCAGCGACA CGTTCGCCAC CCGTGCTGCG
GATCCGGTGC TGCAGCAGCT CGAACAGCCC TACGCGTACG ACATCACCGT CAATGACGCC
TTCCGTCCGC TGTCCAAGTT CTTCGACCGC GTCAACCGGC CCGAGCAGCT GTTCTCCGCG
TTCCACCACG GACTGCGCGT ACTGACGGAT CCGGCCGAAA CCGGCGCTGT CACCATCTCC
CTGCCGCAGG ATGTCCAGGC CGAAGCCTTA GACGTGCCCG AGGAGTTCCT GGCCGAACGC
GAGTGGCGGA TCCGCCGCCC GGACGCGGAC GACGAGGACA TCCGCCGTGC CGCCGAAGCC
ATCCGCGCCG CGAAACGCCC GCTGATCATC GCCGGCGGCG GCGTCCTCTA CGCCTACGCC
AACGACGAAC TGGCCAGGTT CGTGGAACTC ACCGGCATCC CGGTGGGCAA CACCCAGGCC
GGGGTGGGCG TCCTGCCCTG GGACCACAAG TTGTCCCTCG GCGCGATCGG CTCCACGGGC
ACGACGGCGG CAAATGCCCT TGCAGCCGAA GCGGACCTGA TCATCGGCAT CGGCACCCGC
TACGAGGACT TCACCACCGC CTCGCGGACC GCGTTCCAGA ACCCGGATGT GAAGTTCATC
AACATCAACG TAGCCGCCAT GGATGCGTAT AAGCACGGCA CGTCGCTGCC GATCGTCGCG
GATGCGCGCA AGGCACTGGT GAAGCTGAAC CAAGCACTCG GCGGCTACCG CGTGGGCGCC
GACCTGGAAC AGCAGATCGC GGCCGAGAAG AAGCGCTGGG ATGCCACCGT GGACGAAGCG
TTCGACACCC GCTTCACTCC GCTGCCGGCC CAGAACGAAA TCATCGGCGC CACGTCCCGG
GCCATGGACG CCCAGGACGT CGTCGTCTGC GCCGCCGGGT CCCTGCCCGG TGATCTGCAC
AAGATGTGGC GCGTCCGGGA CCCCTTCGGC TACCACGTGG AATACGCCTA CTCCTGCATG
GGCTACGAAA TCCCGGGCGG GCTCGGCGTC AAGCGCGCTG CGCTGGCGGA AGCCGCGCGG
GGCGGGGCGG AGCGCGACGT CGTCGTGATG GTGGGGGACG GTTCGTACCT CATGATGCAC
ACCGAGCTGG TCACCGCCGT CGCCGAACGG ATCAAGCTGA TCGTGGTCCT GATCCAGAAC
CACGGCTACG CCTCGATCGG CTCGCTCTCC GAGTCCCTCG GCTCACAGCG CTTCGGCACC
AAGTACCGCG CCCTCGACGG AGACCACCAC AGCTTCGACG AAGGCGAAAC CCTTCCGGTG
GACCTCGCCC TGAACGCGCA AAGCCTGGGC GTGAAAGTGA TCCGGATCGA ACCGGGGGAG
AAGGTCATCG CCGAACTCGA ACAGGCCATC AGGGACGCCA AAGCCGCCCC CGAGCGCGGC
GGGCCCATCC TGATCCATGT CGAATCCGAC CCCCTGCTGG ACGCCCCGAG CTCCGAATCA
TGGTGGGACG TTCCCGTCTC CCAGGTCTCC GAGCTCGAAT CCACGAGGCA GGCATTCCAG
GCCTACACCG ACCACAAAAA CCGCCAGCGC AAGCTGCTCG GCTAA
 
Protein sequence
MGTRTVTAGT RRMTVAQAVV EYLSKQYTVD SVGGLDYRER LIPGTFGIFG HGNVAGVGQA 
LKQYQQLDPA IMPYYQGRNE QAQAHQAVGY ARHTRRRQTF AISTSIGPGS SNLLTGAALA
TTNRLPVLLL PSDTFATRAA DPVLQQLEQP YAYDITVNDA FRPLSKFFDR VNRPEQLFSA
FHHGLRVLTD PAETGAVTIS LPQDVQAEAL DVPEEFLAER EWRIRRPDAD DEDIRRAAEA
IRAAKRPLII AGGGVLYAYA NDELARFVEL TGIPVGNTQA GVGVLPWDHK LSLGAIGSTG
TTAANALAAE ADLIIGIGTR YEDFTTASRT AFQNPDVKFI NINVAAMDAY KHGTSLPIVA
DARKALVKLN QALGGYRVGA DLEQQIAAEK KRWDATVDEA FDTRFTPLPA QNEIIGATSR
AMDAQDVVVC AAGSLPGDLH KMWRVRDPFG YHVEYAYSCM GYEIPGGLGV KRAALAEAAR
GGAERDVVVM VGDGSYLMMH TELVTAVAER IKLIVVLIQN HGYASIGSLS ESLGSQRFGT
KYRALDGDHH SFDEGETLPV DLALNAQSLG VKVIRIEPGE KVIAELEQAI RDAKAAPERG
GPILIHVESD PLLDAPSSES WWDVPVSQVS ELESTRQAFQ AYTDHKNRQR KLLG