Gene Arth_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2548 
Symbol 
ID4444874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2857465 
End bp2859219 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content64% 
IMG OID639690367 
Productextracellular solute-binding protein 
Protein accessionYP_832027 
Protein GI116671094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0070372 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCAGGA TCGTTCAATT AGACTGGGGA CCGGTCCGCT GCCCGCGGTC CTTCCGGCCG 
CCCAGAGGTG ATCATCGAGT GCCCAGCAGT AAATCGCGCC GCATTGCAGT CAGTATCGGT
GCAGGGGTCC TGTCGCTGCT GCTGGCCGCA TGCACAGGAA CGCCTGCTCC GGCACCGGCG
TCGTCCTCTG CGGCTCCTGC CGGCACGGCA GCCGCTGGCC CTACCGCGAC GTTCACCTTC
GGCACGGCGG CCCAGCCGCT GGGCCTTGAT CCCGCGCTGG CATCCGATGT GGAGTCCTAC
CGGATCACCC GCCAGGTCCT GGAGGGCCTT GTGGGAGTGG ACCAGACCAC GGGCCTGCCC
ACTCCCCTGC TCGCCACTGA GTGGGCGGAA TCCAACGAAG GCCGCGCCTA CACCTTCAAG
CTCCGCGAGG GCGTCACCTT CCAGGACGGT ACACGCTTCG ATGCCGCGGC TGTCTGCGCC
AACTTCAACA GATGGTTCAA CTTCCCGGCG AGCCTGCGGA AGCAGGCGCC CGGCACATCG
TTCAAGGGCG TGTTCAAGGC CTACGCCGAC CAGGCATCGC TCTCCATCTA CAAGGGCTGC
ACCGCCGTCT CTGCCGGGAA CGTCCGGATC GACCTCACCC AGCCGTTCAC CGGGTTCCTC
CAGGCGCTGA CGCTCCCGGC CTTCGCCATA TCCTCCCCGA CGGCCATGGC GGCACAGAAG
GCGGACAGCC TCAGCCAGAC CCGGGACGGC CAGCCCGTGT CGGCCTATGC CCTGCACCCT
GTGGGCACGG GTCCCTTCAG CTTCGCCGCG TGGCAGGACT CCAGCGTCAA GCTGGTCAGC
AACAAGGACT ATTGGGGTGA CAGGGGGCAG ATCGGCACCA TCAACTTCGT CACCTACGAT
CACCCGCAGT CCAGGCTCCA GGCCCTCCTC GACGGGAAGA TCGACGGCTA TGACGCCGTC
ACTGTGGGCA ACTTCGACCA ACTCGTCAAA CGCGGGCAGC AAATCATCCA GCGCGACCCG
TTCTCCGTGA TGTACCTGGG CATGAACCAG GAAGTGCCCA TCCTGCAAAA CATCAAAGTG
CGCCAGGCCA TCGAGATGGC GGTGGACAAG GAAACGCTGA TCCGCCGGTT CTTCATCGAC
AACACTGCCC AGGCAACCCA GTTCGTCCCG CCCAAGCTCA GCGGGTTCAA CAACAACGCC
CCCTCACTGG GCCACGACCC GGCCAAGGCC AAGGCGCTTC TGGAGGAAGC CGGGTACAAG
GGCGAGGAAC TCAAGTTCTA CTACCCCCTC AATGTCACCA GGCCATACCT TCCCACACCC
GAAAAGGTGT ACGCAGAGCT CAGCAGGCAA CTTACCGCTG TGGGCCTGAA CATCAAGCCG
GTTCCGGTGG AGTGGTCGGA CGGGTACCTG CAAAAAGTCC AGTCACCAGG GGACCATGCC
CTGCACCTGC TCGGCTGGAA CGGTTCCTAC TCGGATCCGG ACAATTTTGT GGGTCCCCTG
TTTGGCGAGA AGACCGGTGA ATTCGGCTAC CAGGACCCGC AGGTCTTTTC GAAGATCGCC
CGGGCACGCG GCTTGCCGGA GGGCGAGGAG CGGACGCAGC AATACCGCAC CATCAACGCC
CAGATCGCCG AATCGGTCCC CGCCGTCCCC ATTGCTTTCC CCATTTCAGC TCTGGCGCTC
TCCGACCGGG TGCTGAAGTA CCCTGCCTCG CCGGTATTAA ACGAGGTTTT CACAAAGGTG
GAGCTAAAAC CTTGA
 
Protein sequence
MSRIVQLDWG PVRCPRSFRP PRGDHRVPSS KSRRIAVSIG AGVLSLLLAA CTGTPAPAPA 
SSSAAPAGTA AAGPTATFTF GTAAQPLGLD PALASDVESY RITRQVLEGL VGVDQTTGLP
TPLLATEWAE SNEGRAYTFK LREGVTFQDG TRFDAAAVCA NFNRWFNFPA SLRKQAPGTS
FKGVFKAYAD QASLSIYKGC TAVSAGNVRI DLTQPFTGFL QALTLPAFAI SSPTAMAAQK
ADSLSQTRDG QPVSAYALHP VGTGPFSFAA WQDSSVKLVS NKDYWGDRGQ IGTINFVTYD
HPQSRLQALL DGKIDGYDAV TVGNFDQLVK RGQQIIQRDP FSVMYLGMNQ EVPILQNIKV
RQAIEMAVDK ETLIRRFFID NTAQATQFVP PKLSGFNNNA PSLGHDPAKA KALLEEAGYK
GEELKFYYPL NVTRPYLPTP EKVYAELSRQ LTAVGLNIKP VPVEWSDGYL QKVQSPGDHA
LHLLGWNGSY SDPDNFVGPL FGEKTGEFGY QDPQVFSKIA RARGLPEGEE RTQQYRTINA
QIAESVPAVP IAFPISALAL SDRVLKYPAS PVLNEVFTKV ELKP