Gene Arth_3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3143 
Symbol 
ID4444256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3527159 
End bp3528580 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID639690969 
Productextracellular solute-binding protein 
Protein accessionYP_832621 
Protein GI116671688 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.536467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAG CCACAAACGG GCACCTGCGG GAAATCACCC GTCGTACCGC CCTTGGTGCC 
CTGGGCGCGG GAATCATCGG AGCAACCGTG GCGTCCTGGC CGCGGCTCTC CGGATCGGAC
ATTCCGGGCC GAGGGGACAA CAGCCTCAGT ATCGCCATCA TGGGCACCGC CGCGGACGCC
GCCGCCCGCC AGCGCGCCAT CGACGCCTTC ACCCGCCTCC ACCCGGAGAT CAGGGTCAAA
GTCCAGGCCA TCCAGGCCGT CGACTGGAAG GACTTCTTCA CCAAGATCCT CACCATGGTG
GCCGCGGGCA CCCCGCCGGA TGTGGTCTAC GTGGCCACTG AAGGCGCCCA GCTGTTCGCT
GAAAAGCTTG CCCACCCGCT GGACGAGTAC GTGCGCCGCG ACGCCGCGGA CATGGCCGAG
TTCTTCGACG ACGTCCACCC CAGCCTGGTG GAGGCCTTCA TGTACAAGGG CAGCCTGTAT
CAGCTCCCGA TGGACTGGAA CGCCGCCAAC ATGTACTACA ACACCACCGC GTTCGCGCAG
GCAGGATTGG AGCGCCCGGC GGATGACTGG ACCCACATGG ACTTCCGCAA CAGCCTCGCC
GCCATGCGGA AAGCCCGGAC CTCGGACTTC ACGCCCTACT ACTGGACCAA CCGGCTCTTC
GGCGGAGTGG TGCCGTGGCT CTACGCGAAC GACACCAGCT TCCTGAAGGA GACCAGGTCC
GCCGGTGGAG AGTGGCTTTG GGACGGCTTC TACGCCAACG ATCCCTCCCG CGGCCTCCGC
TCCGGCGGCT ACCAGTGGCT GGAACCCAAC GCCAATGACG ACCGCGTGTT CGAGTCCTTC
GACTACCTCC GCGGACTGGT CAAGGACGGG CTGGGCGTCC GCCCCGAGGA AGGCGGCGGC
AGCTCACTGG TGGGACTGTT CGCATCCAAC CGCATCGGGA CCACCCCCGC CGGCGGCTAC
TGGGTGCAGG GCCTGCACGA AGCCGGGATG GGCGAAAGCG ATTTCGACGT GCAGTTCTTC
CCGCGCTGGA AGAGCCAGCG CCACCAGTTC GGCACCGCGG GCTACGCGAT CATGAAGACC
GCGAAGGACA AGGACGCCGC CTGGGAATGG ATCAAGTTCA GTTCCAGCCG CGAGGCCATG
GAACTGATTT TCCCCAACCC GATTACGACG CCGGCGCGCC GCTCCATGGT GAACGAGCAG
CTTTACGCGG GCAAGGGGCC CGCCCATTGG AAGGTCTTCT ACGACACCCT GGACCGTTTC
CCCACCACCG GCCCCATTCC GGCACCACCC CAGCAGGCGG CCGTCGAAAC GGCCCTGATG
AAGAACGTAT CGCTCGCAGT CAGCGGCGAC GAGCGCCAGC TCAAACAGGC CCTCGCCTCC
ATGCAGCGCG ACCTTGAACT GGCCCTGAGG AGGCAGTCAT GA
 
Protein sequence
MTGATNGHLR EITRRTALGA LGAGIIGATV ASWPRLSGSD IPGRGDNSLS IAIMGTAADA 
AARQRAIDAF TRLHPEIRVK VQAIQAVDWK DFFTKILTMV AAGTPPDVVY VATEGAQLFA
EKLAHPLDEY VRRDAADMAE FFDDVHPSLV EAFMYKGSLY QLPMDWNAAN MYYNTTAFAQ
AGLERPADDW THMDFRNSLA AMRKARTSDF TPYYWTNRLF GGVVPWLYAN DTSFLKETRS
AGGEWLWDGF YANDPSRGLR SGGYQWLEPN ANDDRVFESF DYLRGLVKDG LGVRPEEGGG
SSLVGLFASN RIGTTPAGGY WVQGLHEAGM GESDFDVQFF PRWKSQRHQF GTAGYAIMKT
AKDKDAAWEW IKFSSSREAM ELIFPNPITT PARRSMVNEQ LYAGKGPAHW KVFYDTLDRF
PTTGPIPAPP QQAAVETALM KNVSLAVSGD ERQLKQALAS MQRDLELALR RQS