Gene Arth_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3146 
Symbol 
ID4444259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3530278 
End bp3531552 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID639690972 
Productextracellular solute-binding protein 
Protein accessionYP_832624 
Protein GI116671691 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0629231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGTC CGAACGCAAC GAAGGCAGCC GCCATAGGCC TCGCCGCAGC GCTGCTGATG 
ACCGGCTGCG GCAGGGACAC CGCGGGTTCG TCCCCAGCGT CATCGGCCAA GCCCATTGCC
TCAGGCCAGG CATCCGGCAC CATTACCCTG TGGGCCCAGG GCAGCGAAGG CGAAGCCCTT
CCGGCACTGC TCAAGGAGTT CGAGGCCGAG AATCCCGGCG TCAAGGTCAA CGTCACAGCC
ATCCCCTGGG ACGCGGCCCT CAGTAAGTAC CAGACCGCCA TCGCCGGCGG GACGACGCCG
GACGTCGCCC AGATGGGCAC CACCTGGATG GGCGATTTCG CCAACTCGTT CGATGCCACG
CCCAAAGAGA TCGACGCAAG CGACTTCTTC CCCGGCTCGG TGAAGTCAAC CGAAGTCGAA
GGAACCACCT ACGGGGTGCC GTGGTACGTC GACACCCGCG TGGTCTACTA CCGCAGCGAC
CTCGCGGAGA AGGCCGGCAT CACCAAGGCG CCCGAAACCT GGGATGACTT CAAGGCCCTT
GCCAAGGGCC TTCAAGAGAA GGCCGGGGCA AAATACGGGG TTCAACTGCC TGCCGGGGTC
GCCGGCTCCT ACCTCGACAC CCTCCCGTTC CAGTGGTCCA ACGGAGCGAA GTTGATGAAC
GACGACGGCA CCAAGTGGAC CCTCGACACT CCGGAAGCGG CAGAGGCCCT GAAGTATTAC
TCCAGCTTCT TCGCTGATGG GCTCGCGTCC AAGGCTGTCT CCACGGGAAC CACTGCCGAG
GCGTCCTTCG TGGACGGTTC CGCCCCCATG ATGATCAGCG GTCCCTGGCA CGTCGGCCTG
CTCAACAAGG CCGGCGGGGC AGGATTCGAG GACAAGTACA AGGTTGCCCC GATGCCCAAG
GCGAAGACCT CAACGTCCTT CGTCGGCGGC TCCAACATGG TGGTGTTCAA GAAGTCAGAG
AACCGCGATT CTTCCTGGAA GCTCCTGCAG TGGCTGTCCA AGCCCGAGGT CCAGCTCAAG
TGGTACAAGG CCACCGGCGA CCTCCCTTCG CAGCAGGGTG CCTGGAAGGA CCAGTCCCTG
GCAGGAGACA GCAAGCTCTC GGTCTTCGGC GACCAGCTCA AGACCACCAA CAACCCGCCG
GCCGTTTCCA CCTGGACCCA GGTTGCCGCC GCCGCCGACA GCGAAATCGA ACAGATCGTC
AAGGCCGGCA AGGACCCCGC GGAGGCACTG AAGTCCCTGC AGCAGGCCGC AGATTCGATC
GGCACCGGGA AGTAA
 
Protein sequence
MIRPNATKAA AIGLAAALLM TGCGRDTAGS SPASSAKPIA SGQASGTITL WAQGSEGEAL 
PALLKEFEAE NPGVKVNVTA IPWDAALSKY QTAIAGGTTP DVAQMGTTWM GDFANSFDAT
PKEIDASDFF PGSVKSTEVE GTTYGVPWYV DTRVVYYRSD LAEKAGITKA PETWDDFKAL
AKGLQEKAGA KYGVQLPAGV AGSYLDTLPF QWSNGAKLMN DDGTKWTLDT PEAAEALKYY
SSFFADGLAS KAVSTGTTAE ASFVDGSAPM MISGPWHVGL LNKAGGAGFE DKYKVAPMPK
AKTSTSFVGG SNMVVFKKSE NRDSSWKLLQ WLSKPEVQLK WYKATGDLPS QQGAWKDQSL
AGDSKLSVFG DQLKTTNNPP AVSTWTQVAA AADSEIEQIV KAGKDPAEAL KSLQQAADSI
GTGK