Gene Achl_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2025 
Symbol 
ID7293486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2283642 
End bp2285207 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID643590429 
Productprotein of unknown function DUF349 
Protein accessionYP_002488088 
Protein GI220912779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000000000837835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGACA GTCAGAAATC CGACGAAACA GCAGCAGACG TGACCGACGC GGCGGCTCCC 
GCCACGGACG CAGAACCTGC AGCAGCCAGC GCAACGGCCC CGGAGGCCGT TACCGAAGGT
GTGGAAGAAG CCGGCACGGA GGCAGACCAG CCGGCCGAGT CCCCTGCCCC GGACGCAGAA
CCGGCAGCCC CTGCCCCGTC TCCGGCTCCC GGACCCCGGC CTTCGGCCCC GTCCCCCGCA
GCCTTCGCGT CACGTCCCAA GCCGGCTGCT GCTGCCCCAG CAGCTGCCGT GGCACCCGCC
GCCCCCGCCA CCTCGCCTGC TGAGGCAGCA AAATGGGGAC GCGTGGAAGG CGATGGACAC
GTCTTCCTGA CGATTGACGG CGGAGAACAC GCCGTCGGGC AGTACCCGGG CGTCAGCGAT
GACGAAGCTC TGGCTTACTT CGCGCGGAAG TACGACGACG TCGTGGCCCA GATCGTGCTC
CTGGAACAGC GCGTGAGCTC CAAGGCCCCC AGCACCGACA TGCAAAAGAC CGTCACCCAC
CTGCGCGAAC AGCTTGCGGA GCGCAACATG GTGGGCGACC TCCGGGCAGC CGAAGCCCGA
TTGGATACCC TGGCAACCCA CATCGCCGAG CTTGAGAAGA CGGAAAAGGC AGAGCACGAC
GCCGTCCGGG CAGCCGAACT TGCCGCCCGC GAAGCCATCG TGGCCGAGGC GGAAGAGATT
TCCGGGCAGG ACCCGGCGCA GACCCAGTGG AAGACCTCCA GCGCACGGAT GAACGAGCTG
TTCGAAAACT GGAAGGCTGC GCAGAAGAGC GGCGTCCGGC TGGGCCGCAG CAACGAAGAC
GCCCTCTGGA AGCGCTTCCG CGCTGCCCGT ACCGTTTTTG ACCGGCACCG CCGGGCCTAC
TTCTCCCAGC TGGACAGCAA CAATTCCGCT GCGAAGACCG CCAAGGAAAA GCTCATCTCC
GAGGCAGAGG CCCTCTCCAG CTCCACTGAC TGGGGCTACG CCGCAGGCGA ATACCGCCGC
CTCATGGACC AGTGGAAGGC CTCTCCCCGC GCCAGCCGCA AGGATGATGA CGCGCTGTGG
GCACGCTTCC GTGCCGCGCA GGACGTCTTT TTCACGTCCC GGCAGGCCGC CAACGACGAG
ATCGACCAGG AGTACGCCGC CAACCTGACG GTCAAGGAAG CCCTCCTGGC CGAGGCCAAC
ACGATTCTCC CGATCAAAGA ACTGGCCAGT GCCAAGAAGG CGCTGCAGTC CGTCCGTGAC
CGGTGGGAAG AAGCCGGAAA GGTCCCGCGC GCCGACATGG GCCGGATCGA AGCCGGCCTC
CGCAAGGTCG AGGACGCCGT CCGGCAGGCC GAAGAAGAGC AGTGGCAGCG TTCAAACCCC
GAACGCAAGG CGCGCACCAA CAGTGCCCTT TCACAGCTCG AGAGTGCCAT CGCCGGCCTG
CAGGATGACC TCGCCAAGGC TGAGCAGAGC GGTGACGAGC GCAAGATCAA GGCAGCCAGG
GAAGCCCTCG AGGCACGCCA GGCCTGGCTC GACCAGATCC AGCGTTCCGC CAGCGAACTG
AGCTGA
 
Protein sequence
MTDSQKSDET AADVTDAAAP ATDAEPAAAS ATAPEAVTEG VEEAGTEADQ PAESPAPDAE 
PAAPAPSPAP GPRPSAPSPA AFASRPKPAA AAPAAAVAPA APATSPAEAA KWGRVEGDGH
VFLTIDGGEH AVGQYPGVSD DEALAYFARK YDDVVAQIVL LEQRVSSKAP STDMQKTVTH
LREQLAERNM VGDLRAAEAR LDTLATHIAE LEKTEKAEHD AVRAAELAAR EAIVAEAEEI
SGQDPAQTQW KTSSARMNEL FENWKAAQKS GVRLGRSNED ALWKRFRAAR TVFDRHRRAY
FSQLDSNNSA AKTAKEKLIS EAEALSSSTD WGYAAGEYRR LMDQWKASPR ASRKDDDALW
ARFRAAQDVF FTSRQAANDE IDQEYAANLT VKEALLAEAN TILPIKELAS AKKALQSVRD
RWEEAGKVPR ADMGRIEAGL RKVEDAVRQA EEEQWQRSNP ERKARTNSAL SQLESAIAGL
QDDLAKAEQS GDERKIKAAR EALEARQAWL DQIQRSASEL S