Gene Achl_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2541 
Symbol 
ID7294016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2859480 
End bp2861258 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content65% 
IMG OID643590950 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002488595 
Protein GI220913286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000236938 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGTCA GGCGCCTGAT GCAGTACATC ACCGCAGCGG CAGTGGTCGC AGTGGCCCTT 
TCTGGCTGCT CAGGCGGCGG CAGCGCGCCG GTAGTAGTGG GGGAGGCCAA GCGCGGAGGC
AGCGCCACCG TGGCCGAGGT CAACGCTTTT TCCTCCTTCA ACCCGTTCAG TACCGACGGC
AACACGGACA TCAACTCCAA GATCGGCGCA GCCACCCACT CTGGCTTCTA CTCCCTGGAC
GATAAATCCG TGGTGGTCCG GAACGACAAG TTCGGCCGCT ACGAAAAAGT CTCTGATGAC
CCGCTTAAGG TGCGGTACAC AGTCAACGAA GGCGTCAAAT GGTCCGACGG CGCGCCGATC
GACGCTGGCG ACCTCCTCCT GAGCTGGGCC GCGGGTTCGG GTTATTTCGA CGACGCGGAT
CCTGCGGCCG GGACGGGCAC CAGGTATTTT TCGGCTGCGT CAGCCGCCGG CGGCCTCGCG
GGCACGGCGT TCCCCGAGCT CGGCGACGAC GGCCGGTCCA TCACGCTGCA GTACGCCGCG
CCTTACGCGG ACTGGCAGAC CGCGTTCGAC GTCGGCCTGC CAGCCCATGT GGTCGCAGCC
AAGGCCGGGC TGAGCGACGA GGAGGACCTC GTGGACCTGA TCAAGGACGC GCCCAAGGGA
AACCCCGGAA AACCGGCGGT AAATTCGGCG TTGAAGACGG TGGGCGATTT CTGGAACAAC
GGATTCGATA CGAAATCCCT GCCCGACGAC CCCGCCCTGT ATCTTTCCAG CGGACCCTAC
ATCGTGCGGG ACATCGTTCC GGAAGTATCC ATGAAACTCG TCCGGAACCG GGACTACGTG
TGGGGGACCG AGCCGTGGCT TGACGAGATC AACGTCCGGT TCACCGGTGC CCTTCCTACC
GCTGTTGATG CGCTCCGCAG CGGGCAGGCG GACATCATCT CGCCGCAGCC TTCCGCCGAC
ACCGCGAACC TCTTCGCCGG CCTGGCGGAC CAGGGAAACA CGGTGGAGCA GTACAGCCAG
TCGGGGTACG ACCACCTCGA CCTCAACTTC TCCGGGCCCT TCGCGGACGA GGACGTCCGC
AAGGCCTTCC TGAAGGCCGT GCCCCGGCAG GCCATCGTGG ACGCGGTGGT GGGGGGCCTG
ATTACGGACG CCAAACCGCT CGATTCGCAG GTCTTCCTTC CGGGCCAGCC CAAGTACGCG
GATACTGTGA AAAACAACGG CTCGGCCGAA TACGCCGAGG TGGACATCGA CGCAGCCAAG
GAACTCCTGG ACGGTGCCAC GCCGACCATC CGCATCCTGT ACAACCGGGA CAACCCCAAC
CGCGCCAAGG CATTCACCCT GATCCGCGAT TCGGCGCAGA AGGCCGGTTT CCGAGTGGTC
GATGCCGGCC AGGGAAATGC GGACTGGGCC AAGTCGCTCG GGGGCGCAGG GTACGACGCC
GCTTTGCTGG GGTGGATCGG AACGGGCGCC GGAGTGGGCC GCATCCCGCA GATCTTCCGC
ACCGGGGCGG GCAGCAACTT CAACGGATTC TCCGACGGCG ACGCGGACAA GGCAATGGAG
CAGCTGGCAA CCACCACTGA CCTCGGCAAA CAGGACGAAC TGCTGGCGGG GATCGATAAG
CGCGTCTGGG AGAAAGCGTA CGGACTGCCG CTTTACCAGA CGGTCGGAGC CATAGCCTTC
AACGCCCGGG TGACCGGTGT GAAACCCAGC CCGGGACCCC TCGGCGTGTG GTGGAACGTC
TCGGATTGGC GCCTTGCCGA GCAGGGGACC AAGAACTGA
 
Protein sequence
MPVRRLMQYI TAAAVVAVAL SGCSGGGSAP VVVGEAKRGG SATVAEVNAF SSFNPFSTDG 
NTDINSKIGA ATHSGFYSLD DKSVVVRNDK FGRYEKVSDD PLKVRYTVNE GVKWSDGAPI
DAGDLLLSWA AGSGYFDDAD PAAGTGTRYF SAASAAGGLA GTAFPELGDD GRSITLQYAA
PYADWQTAFD VGLPAHVVAA KAGLSDEEDL VDLIKDAPKG NPGKPAVNSA LKTVGDFWNN
GFDTKSLPDD PALYLSSGPY IVRDIVPEVS MKLVRNRDYV WGTEPWLDEI NVRFTGALPT
AVDALRSGQA DIISPQPSAD TANLFAGLAD QGNTVEQYSQ SGYDHLDLNF SGPFADEDVR
KAFLKAVPRQ AIVDAVVGGL ITDAKPLDSQ VFLPGQPKYA DTVKNNGSAE YAEVDIDAAK
ELLDGATPTI RILYNRDNPN RAKAFTLIRD SAQKAGFRVV DAGQGNADWA KSLGGAGYDA
ALLGWIGTGA GVGRIPQIFR TGAGSNFNGF SDGDADKAME QLATTTDLGK QDELLAGIDK
RVWEKAYGLP LYQTVGAIAF NARVTGVKPS PGPLGVWWNV SDWRLAEQGT KN