Gene Arth_2822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2822 
Symbol 
ID4444618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3172511 
End bp3174154 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content62% 
IMG OID639690644 
Productextracellular solute-binding protein 
Protein accessionYP_832301 
Protein GI116671368 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.630772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTT CGCGCACTTC CAAAGCACTG GGCATGGTTG CCATCGCGGC CCTGGCCCTG 
ACCGGATGCG GAGCCGGAGA CGGCTCCACC ACCGGTAGCG GCAGTACCGG TGGCGACACC
AGCAAAGTAA TCCTGGCCGA CGGCTCCGAG CCCCAGCGTC CCCTCATGCC GGCCGACACC
AACGAGGTCG GCGGCGGCAA GGTCATCGAC ATGATCTTCG CCGGCCTTGT CAGCTATGAC
CCCAGCGGCA AGCCCGTCAA CGAACTGGCC GAGTCCATTG AAGGCAAAGA CGGCCAGCAC
TTCACCATCA AGATCAAGAA GGACCAGAAG TTCACGAACG GCGAGGCCAT CACGGCCAAG
TCATTCGTCG ATTCCTGGAA CTTCGGGGCT GCTGCCAAGA ATGCCCAGTT GAGCGGGGGC
TTCTTTGAAA GCATTGCCGG CTACGACGAG GCCAGTGCAG AAGGCTCCAC CGTGGAGACC
ATGTCCGGCC TCAAGGTCGT TGACGACCAG ACGTTCACGG TTGAACTGAA GCAGCCCGAA
TCCGACTGGC CGCTTCGACT CGGCTACACC GCCTTCGTTC CGGTTCCCTC CGGCGCGCTG
AAGGACCCGA AGGGTTTCGG CGAGAAGCCG GTCGGCAACG GCCCGTACAA GCTGGCGGAC
GGCGGTTGGC AGCACAACGT GCAGATCCAG CTCGTGCCGA ACCCGGACTA CAACGGCCCG
CGCAAGGCCA AGAACGCCGG CGTGACCTTC AAGATCTTCC AGAACGACGA CGCCGCGTAC
CAGGACCTGC TGTCCAACAA TCTGGACATC CTGCAGACTA TCCCCACCAG CGCCTTGAAG
AACTTCAAGA CCGACCTGGG TGACCGCACC ATCAACAAGC CGTACGCCGG CAACCAGACC
ATTGCCATCC CGGAATACCT GCCGGAATGG AGCGGTGAGG CAGGCAAGCT TCGTCGCCAG
GCCATCTCCA TGGCCATCAA CCGGGAAGAG ATCACCAAGG TGATCTTCAG CGGTGCACGC
CAGCCCGCCA AGGACTTCAC TGCTCCCGTC CTTGACGGCT ACAGCGACTC GATCACGGGT
TCCGAGAACC TGACGTTCGA CGCCACGAAG GCAAAAGAAG CCTGGGCCAA GGCCGACGCC
ATCCAGAAGT GGGACTCCAA CGAGACCTTC ACTATTGCCT ACAACGCCGA CAAGGGCGGA
CACAAGGCCT GGGTCGAAGC CGTAGTGAAC CAGCTCAAGA ACACGCTCGG CATCAAGGTT
GAGGGCAAGC CGTACGCCAC CTTCAAGGAA GCCCGCAACG ACGCCACCGC CAAGACGCTG
ACCGGCTCCA TCCGCGCCGG CTGGCAGGCG GATTACCCGT CGCTGTACAA CTTCCTCGGA
CCGATCTACA AGACCGGTGC AGGCTCTAAC GACGCCAAGT ACGCCAACCC GACGTTCGAC
AAGGCCATCT CTGAAGGACT GGCCGCTTCC TCCGTCAGCG AAGGCAACAA GGCCATGAAC
AAGGCCCAGG AAATCCTCCT GGCCGACCTT CCGGCCATCC CCCTGTGGTA CCAGGTTGCA
CAGGGCGGCT GGAGCGACAA GGTCACCAAC GTTGACTACG GCTGGGACGG CGTCCCGCTG
TACTACAACA TCACTGGCAA GTAA
 
Protein sequence
MRFSRTSKAL GMVAIAALAL TGCGAGDGST TGSGSTGGDT SKVILADGSE PQRPLMPADT 
NEVGGGKVID MIFAGLVSYD PSGKPVNELA ESIEGKDGQH FTIKIKKDQK FTNGEAITAK
SFVDSWNFGA AAKNAQLSGG FFESIAGYDE ASAEGSTVET MSGLKVVDDQ TFTVELKQPE
SDWPLRLGYT AFVPVPSGAL KDPKGFGEKP VGNGPYKLAD GGWQHNVQIQ LVPNPDYNGP
RKAKNAGVTF KIFQNDDAAY QDLLSNNLDI LQTIPTSALK NFKTDLGDRT INKPYAGNQT
IAIPEYLPEW SGEAGKLRRQ AISMAINREE ITKVIFSGAR QPAKDFTAPV LDGYSDSITG
SENLTFDATK AKEAWAKADA IQKWDSNETF TIAYNADKGG HKAWVEAVVN QLKNTLGIKV
EGKPYATFKE ARNDATAKTL TGSIRAGWQA DYPSLYNFLG PIYKTGAGSN DAKYANPTFD
KAISEGLAAS SVSEGNKAMN KAQEILLADL PAIPLWYQVA QGGWSDKVTN VDYGWDGVPL
YYNITGK