Gene Achl_3805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3805 
Symbol 
ID7295293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4247024 
End bp4248796 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content62% 
IMG OID643592215 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002489847 
Protein GI220914538 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCTA GATTTAGCCG ACCCACCCGC ACCACTCGCA TCAACGAAGG GAAAACCATG 
AGGAATCTGA ACAGGATCGG CGGAGCTGCA GCCATTGCTG CGGCTCTGGC GCTGACGGCC
TGCGGCAGTG GCGGCTCCAC GGGGCCGGAG ACAGCCAAGG GGCAGGAGGC AGGCAGCGAC
CTGTCCAAGC TGATCAGCAT CAACGAGAAG CCGGCAGCAG ATCTTGAGCA GGGCGGCAAG
GTCACCCTTC CGCTGGGCAA CATCGGCCCG GACTTCAACG GCTTCTCCAA CAACGGCAAC
AGCGCGGACA ACTCCGCCCT GCAGGTCCCC ATCAACCCGG TTGCCATGAA CAGCGGCGGC
ATGGGTGGCT GCTGGAAGGT CGACTTCACC GGAAAGGTCA CGCCCAACAC GGACTTCTGC
GAGTCCGTGG AGAGCGAGGT CACGGACGGC AAGCAGACCA TCACCATCAA GGTCAATGAC
AAGGCCACCT ACAACGACGG CACACCCATC GACGTGAAGG CGTTCCAGAA CACCTGGAAC
ATCCTCAAGA GCCCGGACAA CGGCTACGAC ATCGTCAGCT CCGGCGCCTA CCAGTTCGTC
GAGTCGGTAG AGGCCGGCAG CAGCGACAAG GAAGTCATCG TCAAGACCAC ACAGCCGGTC
TACCCCATTG ACTCGCTCTT CTTCGGCCTG ATCCACCCGG CCGTGAACAG CCCGGAGATC
TTCAACGAAG GCTTCAACGG CAACATGCAC GCCGAGTGGA TGGCCGGCCC GTTCAAGCTT
GACCAGTACG ACTCCGCTGC CAAGACGGTG ACTCTTGTTC CCAACGAAAA GTGGTGGGGC
AAGAAGCCGG TACTGGCCAA CGTCACCTTC CGCCAGCTGG AGAGCAGCGC CCAGATCGCC
GCCTTCAAGA ACGGTGAAAT CGACGCCGTT TCGGCCAACA CCATCACTCC GTACAAGCAG
CTGGAAGGCA CCAAGGACGC AGATATCCGC CGTGGCCAGC GCCTGTTCGC CGGCGGCATG
AACATCAATG CCCAGAAGGT CACCGACGTC GCCATTCGCG AAGCCATCTT CGCCGCCGTG
GATCGCGAAG CCCTCCGCAA GGTCCGCTTC AACGGCCTGA ACTGGGAAGA GCCCAGCTCC
GGCTCCATGA TGCTCCTGCC GTTCTCCGAG TACTACCAGG ACAACTACCC CGTGAAGGAA
ACAGGTCCGG ACGCTGCCAA GAAGGTCCTG ACGGATGCCG GCTACACGGC CAACGCCAAC
GGCATCATGG AGAAGGACGG CAAGCCTGCC GCCTTCAAGA TCAGCAACTT CGGTGACGAT
CCCACCACCC TGGCCTTCAC ACAGACCCTG CAGAAGCAGC TGCAGGCCGG CGGCATGGAC
GTCGGAATCG ACCAGCGTGC TTCCGCCGAC TTCGGCAAGG TCATCGGCGG CCGCGACTTC
GACATGAGCG TTTCGGGCTA TACCGTAGGC CCGGACGCCA CCGACGCGGT GAAGCAGTAC
TACGATTCCA AGACCAACGA GAACCAGCTG GGCGACGCCG AGCTGGACAA GAAGATCGCC
GACCTCGCCT CCATCGAGGA CAACGCCGAG CGCAACAAGG CGGCCATGGA GGTCGAAAAG
GAGCACATGG CGAAGTACTT CTCCATGGGC GTGGTCATGA ACGGCCCGCA GATCTCCTTC
GTCCGCACGG GCCTCGCCAA CTACGGCCCG TCGCTGTTCC AGAGCCTGTC CCAGGTTCCG
GACTGGACCA CCCTGGGCTG GGAAAAGAAG TAA
 
Protein sequence
MHPRFSRPTR TTRINEGKTM RNLNRIGGAA AIAAALALTA CGSGGSTGPE TAKGQEAGSD 
LSKLISINEK PAADLEQGGK VTLPLGNIGP DFNGFSNNGN SADNSALQVP INPVAMNSGG
MGGCWKVDFT GKVTPNTDFC ESVESEVTDG KQTITIKVND KATYNDGTPI DVKAFQNTWN
ILKSPDNGYD IVSSGAYQFV ESVEAGSSDK EVIVKTTQPV YPIDSLFFGL IHPAVNSPEI
FNEGFNGNMH AEWMAGPFKL DQYDSAAKTV TLVPNEKWWG KKPVLANVTF RQLESSAQIA
AFKNGEIDAV SANTITPYKQ LEGTKDADIR RGQRLFAGGM NINAQKVTDV AIREAIFAAV
DREALRKVRF NGLNWEEPSS GSMMLLPFSE YYQDNYPVKE TGPDAAKKVL TDAGYTANAN
GIMEKDGKPA AFKISNFGDD PTTLAFTQTL QKQLQAGGMD VGIDQRASAD FGKVIGGRDF
DMSVSGYTVG PDATDAVKQY YDSKTNENQL GDAELDKKIA DLASIEDNAE RNKAAMEVEK
EHMAKYFSMG VVMNGPQISF VRTGLANYGP SLFQSLSQVP DWTTLGWEKK