Gene Arth_2724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2724 
Symbol 
ID4444589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3062826 
End bp3064460 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content62% 
IMG OID639690544 
Productextracellular solute-binding protein 
Protein accessionYP_832203 
Protein GI116671270 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGA ACAAAAAGGC CCTGCACAGC GCTATCGCGC TCGCGGGTGT TTCCGCGTTT 
GCACTGACAG CCTGCACAGG TCCGTCCGGC GGCGGCGGAA CTTCCACCGG CGGCGCCGGA
GGCGGAACCA TTACCTACGG CACCACGGAC AAGGTCGTTA CCCTCGATCC TGCGGGCTCG
TACGACGCCG GTTCCTTCAT GGTGATGAAC CAGATCTACC CGTTCCTGCT GAACGCCAAG
CCCGGCACGG CGGACGCCAC ACCCGATATC GCAGAGTCCG CGGAATTCAC GAGCCCCACG
GAGTACACCG TCAAGCTCAA GTCGGACCTC AAATTTGCCA ATGGACACGC GCTCACCTCC
TCCGACGTGA AGTTCTCCAT CGACCGCGTG GTCAAGATCG CGGACGACAA CGGCCCTGCC
TCGCTTCTGG GCAACCTGGA GTCGGTTACC GCCAAGGACG ACTCCACGGT GGTCTTCAAG
CTCAAGGCCG GCAATGACCA GGTCTTCCCG GGCGTCCTTG CTGCCAATGC AGGACCCATC
GTCGATGAAG AGGTCTTCCC GGCGGACAAG CTCATGAGCG ACGACGAAAT CGTCAAGGGC
AAGCCGTTCG CCGGCCCCTA CACGATCGAG AGCTACAAGA AGAACGAGCT TGTGAGCCTG
AAGGTCAACC CGGACTACAA GGGCCTGCTG GGCAAGCCCG CCAATGACGG CGCGAGCATC
AAGTACTACG CCGATTCGAA CAACCTCAAG CTCGACGTCC AGCAGGGCAA CATCGACGTT
GCCGGCCGCA GCCTGACCGC TACGGACGCC GCTGACCTCG AAAAGGACTC CAAGGTCACC
GTCCACAAGG GTCCCGGCGG CGAGCTGCGC TACATCGTGT TCAACTTCGA CACCATGCCG
TTCGGAGCGA AGACCGCCGA GGCAGATCCC GCCAAGGCGC TCGCCGTCCG CCAGGCCATG
GCGAACGTCG TTGACCGCGA CGCCATCGCA ACCCAGGTCT ACAAGGGCAC CTACCTGCCC
GCGTACTCCG TAGTCCCCGA CGGGTTCGTC GGAGCCATCC AGCCGCTCAA GGAAATGTAC
GGCGACGGCA GCGGCAAGCC CAGCCTGGAC AAGGCCAAGA AGGCATTCTC CGAGGCAGGC
GTAACGGCCC CGGTCAACAT TAAGCTGCAG TACAACCCCG ACCACTACGG CAAGTCCTCG
GGCGACGAAT ACGCCATGAT CAAGGAACAG CTGGAGAAGT CCGGCCTCTT CAAGGTGGAC
CTGCAGTCCA CTGAATGGGT GACCTACTCA AAGGACCGCA CCAAGGACGT CTACCCGGTC
TACCAGCTCG GCTGGTTCCC GGACTACTCG GACGCGGACA ACTACCTGAC CCCGTTCTTC
GTACCGGGCA ACTTCCTGAA GAACCACTAC GAAAACCCGT CCGTGACGGA CCTGATCACC
AAACAGCTCA CCACTGTTGA CAAGGCAGAG CGCGAGAAGG TCCTGGGTGA AGCCCAGACG
TCAGTTGCCA AGGATCTCTC CACGCTGCCG CTGCTGCAGG GCGCCCAGCT CATGGTCGCC
GGAAAGGACG TCAAGGGTGT TGAAAAGACC CTGGACGCGT CCTTCAAGAC CCGTCTTGGC
GTGATTTCCA AGTAG
 
Protein sequence
MAMNKKALHS AIALAGVSAF ALTACTGPSG GGGTSTGGAG GGTITYGTTD KVVTLDPAGS 
YDAGSFMVMN QIYPFLLNAK PGTADATPDI AESAEFTSPT EYTVKLKSDL KFANGHALTS
SDVKFSIDRV VKIADDNGPA SLLGNLESVT AKDDSTVVFK LKAGNDQVFP GVLAANAGPI
VDEEVFPADK LMSDDEIVKG KPFAGPYTIE SYKKNELVSL KVNPDYKGLL GKPANDGASI
KYYADSNNLK LDVQQGNIDV AGRSLTATDA ADLEKDSKVT VHKGPGGELR YIVFNFDTMP
FGAKTAEADP AKALAVRQAM ANVVDRDAIA TQVYKGTYLP AYSVVPDGFV GAIQPLKEMY
GDGSGKPSLD KAKKAFSEAG VTAPVNIKLQ YNPDHYGKSS GDEYAMIKEQ LEKSGLFKVD
LQSTEWVTYS KDRTKDVYPV YQLGWFPDYS DADNYLTPFF VPGNFLKNHY ENPSVTDLIT
KQLTTVDKAE REKVLGEAQT SVAKDLSTLP LLQGAQLMVA GKDVKGVEKT LDASFKTRLG
VISK