Gene Arth_1892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1892 
Symbol 
ID4445581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2128001 
End bp2129884 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content62% 
IMG OID639689704 
ProductPTS system, beta-glucoside-specific IIABC subunit 
Protein accessionYP_831376 
Protein GI116670443 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01297] cation diffusion facilitator family transporter
[TIGR01995] PTS system, beta-glucoside-specific IIABC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000024125 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCC AAGAGAGTGC AAAGGCGATT CTCGAGCACG TCGGCGGCGC CGGCAACATC 
TCGAATCTGC ACCACTGCTC GACCCGTTTG CGCTTCACGC TCGCGGACGA GAACAAGACA
AATGAGACCG CCCTTAAAGC CATTCCTGGC GTCATCGGCG TCGTCAAGGG CTCATCCCAA
ACTCAGGTGA TCATCGGAAA CGGCGTGGGC GAGATATACG CAGCGGTCGA GAAGCTGCGC
GGCGGCAACC AGCCCGCGGC GCCTGCCGGG AAGCGCCCAT TCAGCTGGAA GCGCCTTGGC
GCGACGATCA TGGACTTCGT CGTCAGTGTT TTCACGCCGA TCATCCCCGC CATCGCGGGC
TCTGGAATCT TCAAATCGCT CCTCGTTCTC GCCTCCGCAC TCGGGTGGCT TGACGCGCGC
AGTGAAAACT ACAAGCTGTT GGCGGTCATC CCCGACGCCG TCTTCGGTTT CATACCACTC
CTCGTGGCCT ACACCAGCGC CAAGAAACTC AACGTCAACA TACCTCTAGC TATCGGCTTA
GTCGGCGTGC TGGTCTATCC GGCGTTCACC GCCCTGGCAA CCAAAGACGG CGGAATCGCA
CTCTTCGGCA TGACCGTCCC GGTCGTCACC TACACCGCCC AGGTCTTCCC CGCAATTCTC
GCGGTGCTGC TGCTCTCTGT GGTCGAGCGG TTCTTCACCA AAATCACCTG GGCCCCCATC
CGCGTTTTCT TCGTACCCAT GATGTGCATT GTTATCGTCG CCCCGGCGAC GATCTTCCTG
CTCGGACCGC TCGGCTTCTG GCTCGGAACC CTTCTGACAG GAGCAATGAC CGGCCTGCAC
GGCTCGTTCG GTTGGGTCGC TGTCATGCTG CTCGCTGGAG TGCTGCCGTT GATTATCTCC
GTAGGAATGC ACAAGGCCTT CATTCCTCCG ACCATCGCCA CGATTGCCGC CACCGGCCAG
GAATCGCTGT ACCAGCCGGC GTCGCTAGCG CACAACCTCA GTGAGGCGGG CGCGACATTC
GGTGTCGCCG TCCGGACCAA GAGCACTGCC CTGCGCGCAA CAGCCATTTC CGGTGGAATC
TCGGCACTCT TTGGTATCAC GGAGCCTGCA CTCTACGGCG TCACCCTTCA GAACCGCCGG
GCCTTCATCG CCGTCGTCGC GGGCAGCATG TCCGCAGGCG CATACATGGG CATCATGCAG
GTCGCCGGAT TTGTAGCAGT TGGTCCCGGA CTTGCAAGCA TCACATCATT CATCGACGCC
GAGAACCCCC AAAATCTCCT GAACGCCGTC ATCGGCCTGC TGATCGCCGT GGCTGTTTCA
TTCACGACCT CTCTCATCCT CTGGCGAGAC GACGCATCCG CTACGGTACG AGCACTCGGC
GATGTGAAGC TCCCGGCACT CAACAGCGCC AACGTGGTCA GCCCCATCTC CGGAGACATC
ATCGCACTGT CAGAAGTAAA CGACCCGGTG TTCTCTGCCG GCATCCTCGG GGAGGGTATC
GCCATACGCC CCACCGACGG CGCCGTGCGC GCTCCGATAG CAGGAGTGGT GACAGCCCTG
CTCGGCTCGA AGCACGCCAT CGGCATCCGA GGCGACGACG GCGTCGAAGT CCTCGTGCAC
GTGGGTCTCG ACACCGTGCA GCTCGGCGGC AGCGCCTTCA CCGCCCACGT GGCCATTAAC
GACAAGGTCG AGGCCGGGCA GCTCTTGCTC GAGGCCGACC TGACCGCGAT CACCGAAGCC
GGCTACGACA CCACCACCCC TGTGGTCATC GTGAACTCGG CCAGCTTCAC TGTCGCCCTC
ACCGCCTCCG GCTCGGTCAC GGCGGGCGAA CCGCTGCTGA GCGTCAACGA AAAAACCAAG
GAGAACGTCA ATGTCACTGC CTAA
 
Protein sequence
MTTQESAKAI LEHVGGAGNI SNLHHCSTRL RFTLADENKT NETALKAIPG VIGVVKGSSQ 
TQVIIGNGVG EIYAAVEKLR GGNQPAAPAG KRPFSWKRLG ATIMDFVVSV FTPIIPAIAG
SGIFKSLLVL ASALGWLDAR SENYKLLAVI PDAVFGFIPL LVAYTSAKKL NVNIPLAIGL
VGVLVYPAFT ALATKDGGIA LFGMTVPVVT YTAQVFPAIL AVLLLSVVER FFTKITWAPI
RVFFVPMMCI VIVAPATIFL LGPLGFWLGT LLTGAMTGLH GSFGWVAVML LAGVLPLIIS
VGMHKAFIPP TIATIAATGQ ESLYQPASLA HNLSEAGATF GVAVRTKSTA LRATAISGGI
SALFGITEPA LYGVTLQNRR AFIAVVAGSM SAGAYMGIMQ VAGFVAVGPG LASITSFIDA
ENPQNLLNAV IGLLIAVAVS FTTSLILWRD DASATVRALG DVKLPALNSA NVVSPISGDI
IALSEVNDPV FSAGILGEGI AIRPTDGAVR APIAGVVTAL LGSKHAIGIR GDDGVEVLVH
VGLDTVQLGG SAFTAHVAIN DKVEAGQLLL EADLTAITEA GYDTTTPVVI VNSASFTVAL
TASGSVTAGE PLLSVNEKTK ENVNVTA