Gene Achl_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2091 
Symbol 
ID7293552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2355566 
End bp2357116 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content63% 
IMG OID643590490 
Productband 7 protein 
Protein accessionYP_002488149 
Protein GI220912840 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000000648722 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTATTG GGGGAACCAT GGTGATCGTC CTGATCGCCG CAGCTGCAGT GGTGGTGCTG 
CTCATCGCCG GATTCATTTT CTATACCAAG AGCATCCGGT TCGCCAAGCC CAACGAGGCG
ATGCTGATTA CCGGCAAGAG CGATCCGAAC ACAACGAACG AAACTTCCGA TGACCAGTCG
CGGGTCATCA TCAACAACAG GGCATTCGTC AATCCCATCA CCGAGCGCGT CAGCCACATC
TCGCTGTCCT CGCGCCAGGT TGAGGTGACC ATCGAGGCCA TTTCGAACAA CGGCATCCAG
CTCAAGCTGA CTGGTGTTGC CCAGGTCAAG GTCGGGGGCG ACAAGGTTTC GGTCCGCAAG
GCCGCCCAGC GGTTCCTGGA CCAGCAGGAC GCCATTGACC ACTACACGCA GGAAACCCTG
TCTGGCTCCC TGCGCTCGAT CGTCGGCACG CTGAGCGTTG ATGCGATCAT CAAGGACCGC
GCGCAGTTCG CGGCCTCCGT CAAGGAGGAG GCCGAACACT CCATGACCAA CCAGGGCCTG
GTGATCGACA CCTTCCAGAT CAAGTCCGTG GACGATACCG GCGGCTACCT GAAGAACCTG
GGGCGTCCCG AAGCTGCGCT GGTGGCCCGG AACGCGAGCA TCGCCGAGGC CAATTCGCAG
CGCGAGGCCG CGGAGGCGAA AGCCCTTGCG GACCAGAAGA CGGCTGAGGC GGAGCAGAAG
CTGGCGCTCC GCCGCGCCGA ACTCAAGCAG GAAACCGATG CCCGCCAGGC TGAGGCCGAT
GCTGCCGGGC CGCTGGCCCA GGCTGACCAG CAGGAAGCGA TCATCCTGAA GAACCAGCAG
GTGGTGGCAC GCCAGGCGGA ACTCCGTGAA AAGGAACTCG ACATCGAGGT CCGCAAGCCC
GCCGATGCCG CCAAGTACAA GGTGGAAACG GAAGCTGCAG CGGATGTGTC CCGCCGTACC
CGGATCTCCG AAGCAACCAA GGTGGAGGCA GCCGCCGAAC TGGAAACCAG GAAACTGCGG
GCTGCCGGCA ACGAGGTGGA AGCCCAGGCG CTGGCTGCCG CCAACACGGC CAAGGGAAAC
GCGGAAACGG AGATCAACAA GATCCGCGGC CTGGCCGAGG CGGAAGTCAC CAAGTCGCAG
GGTATTGCCG AAGCGGATGT CATCGGACTG CGGGGCAAGG CCGAAGCGGA AGCCATCGAA
GCCCAGGCCA AGGCGTACAG CGAGTTCAAC GAGGCAGCCA TCCTGAACAA GCTGCTGGAA
GTCCTGCCGT CCATCGCGAA GGAAATCGCG GCCCCGATGG GTGCCATCAG CAACATGACG
GTCATCTCGA ACGACGGCGC CGGACAGGTG AGCAGGAACG TCTCCTCGGG CGTGCACGAA
ACAGCCCAGC TCCTCAAGGA CACCACCGGC TTTGACGTCA TCCAGATGCT GAAGGGCTTC
GGCCAGACGT CGGCAACGCC AACAGCCGGC ACGTCCGCGA CGTCCGTTGG TTCAGCAGGC
AACGGCAAGT CGCCGGAGCA GGCCACACCG CAGGGCCCCG GGCAGGACTA G
 
Protein sequence
MIIGGTMVIV LIAAAAVVVL LIAGFIFYTK SIRFAKPNEA MLITGKSDPN TTNETSDDQS 
RVIINNRAFV NPITERVSHI SLSSRQVEVT IEAISNNGIQ LKLTGVAQVK VGGDKVSVRK
AAQRFLDQQD AIDHYTQETL SGSLRSIVGT LSVDAIIKDR AQFAASVKEE AEHSMTNQGL
VIDTFQIKSV DDTGGYLKNL GRPEAALVAR NASIAEANSQ REAAEAKALA DQKTAEAEQK
LALRRAELKQ ETDARQAEAD AAGPLAQADQ QEAIILKNQQ VVARQAELRE KELDIEVRKP
ADAAKYKVET EAAADVSRRT RISEATKVEA AAELETRKLR AAGNEVEAQA LAAANTAKGN
AETEINKIRG LAEAEVTKSQ GIAEADVIGL RGKAEAEAIE AQAKAYSEFN EAAILNKLLE
VLPSIAKEIA APMGAISNMT VISNDGAGQV SRNVSSGVHE TAQLLKDTTG FDVIQMLKGF
GQTSATPTAG TSATSVGSAG NGKSPEQATP QGPGQD