Gene Achl_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1779 
Symbol 
ID7293239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2011721 
End bp2013364 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID643590187 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002487847 
Protein GI220912538 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000779401 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCCCA TCGAATTCCG CCGCCGCGCC TTCCTGGCCG GCATCACCGC CATCACCGGT 
TCCGCCGTCC TCACCGCATG CGGCGGCCCC TCGGCCACCA GTTCCTCCGA CGCCGGCACG
CCGGTTGACG GCGGAAACAT CACCTTCCTC ATCCAGGGCT ACGACACCGG CTGGGTCTCG
AGCAAGACGT CCATTTCCAG CTACGAGGGC AACCTCTGGG GCCAGATCAC CGACAAGCTG
GTCTACGTAG ACGACAAAGG CCAGCTCAGC CCCTGGGTGG CCGAAAGCTG GGAGGAACTC
AACGGCGCCA AGGATTTCGT CCTGCACCTC AAAGACGGAG TGACTTTCTC CGACGGCACG
CCCCTGGACG CGGCCGCCGT CGTGGCCAAC CTCAACGCTT GGGCCAAGGG GGCCCCGGAC
CGCGGCGTCA GCAAGGTGGG CCTCTTCCCG TCCAGCAACT TCGCCAGCGC GGAGGCCGCC
GACGCCAGGA CGGTCAAGGT GTCCTTCTCC TCCCCCGCGC TCGGCTTCAT CGCCACGCTG
GCGTACCACG GCTGCATCCT CCTGTCCCCC AAGACCCTCG CTCTGCCGGT GGACGCCCAG
GCGGACCTGG CGCAGGAAAT CGGCAGCGGC CCGTTCATCC TCAAGTCCTG GAAGCAGGGC
GACTCGTACG TGCTCGAGAA GCGCAAGGAC TACAACTGGG GGCCGGCCGC CCTGGGCCAC
ACGGGCCCGG CCCGCCTGGA CACCATCACC TACAAGGTCA TCAAGGACAC CTCGGTGCGG
ACCTCCACGG TGGCGTCCGG CCAGGCGGAA GTTGCCTTCA ACGTGGAGCC GCAGGAAATC
GACTCCCTCA AAGCGCAGGG CTTCACCGTG GGAACACCCA AGTACCTGGG CTTCGTGGAC
GGCTTCCAGG TCAACACCCA GGCCTTCCCC ACCAACGATC CCAGCGTCCG CCAGGCCATC
CAGCACGGCA TCGACCGTGA GGAGATCCGG AACACCGTCT ACACGGAGGA CTGGGATGCG
GCCACCACGT TCATCCAGGG CAACGTCCCG GAGGCCGGCG ACTACAGCAG TGCCTTCGCC
TTCGATGCGG ACAAGGCGAA GAAGCTGCTG GACGACGCCG GCTGGAAGCC CGGTCCCGAC
GGGTTCCGCG TGAAGGACGG CAAGGTCCTC GAGTTCCCGC TCACGCCCAA CCCCTACGTT
CCCTCCACCA AGGCCGAGGA CGAGCTCATC GCCCAGCAGC TGGAGCGCAT CGGCATCAAG
GTCAACCTCA AGGTGGTGGA CGTGGCCGGT TACGCCGCCA TCCAGGCCAG CCGTCCGCCG
CTGTTCCAGA CTTCCCGCAG CTTCGTGGAC GTGGGAACGG TGGCCGGCGT GCTGACCAGC
CAGAACAACG GCGAAAACTG GTTCAACCTG GGCACCAGTG ACCAGAAGCT CAACGATCTG
TCCACCGCGA TCGCCAGCGC CTCTGACAGG GAATCCCGCA AAAAGGTGGC CGGTGACCTG
CAGCAGTACG TCCTGGAACA GGGCTACTTC ATCCCCCTCA ACCAGCTGGT TCAGCGCCTG
TACCTGATCT CGCCCGCGGT CAAGGGCGTC CAGTACAACG GCCTGGCGTA CGCCAACTTC
TACACCGCCT GGGTGGCCAA GTGA
 
Protein sequence
MTPIEFRRRA FLAGITAITG SAVLTACGGP SATSSSDAGT PVDGGNITFL IQGYDTGWVS 
SKTSISSYEG NLWGQITDKL VYVDDKGQLS PWVAESWEEL NGAKDFVLHL KDGVTFSDGT
PLDAAAVVAN LNAWAKGAPD RGVSKVGLFP SSNFASAEAA DARTVKVSFS SPALGFIATL
AYHGCILLSP KTLALPVDAQ ADLAQEIGSG PFILKSWKQG DSYVLEKRKD YNWGPAALGH
TGPARLDTIT YKVIKDTSVR TSTVASGQAE VAFNVEPQEI DSLKAQGFTV GTPKYLGFVD
GFQVNTQAFP TNDPSVRQAI QHGIDREEIR NTVYTEDWDA ATTFIQGNVP EAGDYSSAFA
FDADKAKKLL DDAGWKPGPD GFRVKDGKVL EFPLTPNPYV PSTKAEDELI AQQLERIGIK
VNLKVVDVAG YAAIQASRPP LFQTSRSFVD VGTVAGVLTS QNNGENWFNL GTSDQKLNDL
STAIASASDR ESRKKVAGDL QQYVLEQGYF IPLNQLVQRL YLISPAVKGV QYNGLAYANF
YTAWVAK