Gene Arth_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4004 
Symbol 
ID4447267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4519928 
End bp4521901 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content63% 
IMG OID639691835 
ProductPTS system, mannitol-specific IIC subunit 
Protein accessionYP_833479 
Protein GI116672546 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2213] Phosphotransferase system, mannitol-specific IIBC component
[COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain 
TIGRFAM ID[TIGR00851] PTS system, mannitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.852778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG AGACAGTTGC AAAACCCCGC ACCAGCCTGC GGGTTGGCGT CCAGAAATTC 
GGGACGTTCC TGTCCGGAAT GATCATGCCC AACATCGGCG CTTTCATCGC CTGGGGCATC
ATCACGGCCT TCTTCATTCC GGCGGGCTTT ACTCCCAATG AGGAACTGGC CAAGCTCGTT
GGCCCGATGA TCACCTTCCT GCTTCCGCTC CTGATCGGCT ACACCGGCGG TCGCATGGTC
CACGGCGTCC GTGGCGGCGT TGTCGGCGCG GCCGCAACTA TGGGCGTGAT CGTCGGTACG
GACATCCCCA TGTTCATCGG CGCCATGATC ATGGGCCCGC TGACCGCATG GATCATGAAG
AAGCTGGACA AGATCTGGGA AGGCCGGGTC AAGCCGGGCT TCGAGATGCT GATCGACAAC
TTCACCGCAG GCATCGTGGC AGCAGCCATG GCCATCGTGG GCATGCTGGT GATCGGCCCG
GTGGTGAAGG CCTTCAGCAA CGGCGCCAGT TCCGTCGTCG AATTCCTGGT CAACAACGGC
TTGCTGCCGT TCACCAGCAT CTTCATCGAG CCGGCCAAGG TACTGTTCCT GAACAACGCC
GTGAACCATG GCATCCTGAC GCCGCTGGGT ACGGAACAGG CACTGCAAAA CGGAAAATCC
ATCCTGTTCC TGCTCGAAGC CAACCCGGGT CCCGGCGTGG GCATCCTGCT TGCGTACATG
ATCTTCGGCA AGGGCCTGGC CAAGGCGTCA GCCCCCGGCG CCGCCCTGAT CCAGTTTGTT
GGCGGTATCC ACGAAATCTA CTTCCCGTTC GTACTGATGA AGCCCATCAT CATCCTGGCC
GCAATCGGAG GCGGGATGAC GGGCATCTTC ACCCTGGTGC TCACCGGCGC AGGCCTGCGC
TCCCCGGCCG CCCCGGGCAG CATCATCGCC GTCTTCGCCG CGACCGCCAG CGACAGCTAC
TTCGGAGTGG CGCTGTCCGT GCTGCTCGCC GCCACGGTGT CCTTCCTGAT CGCTTCGGTG
ATCCTGAAGT CCAGCAAGAC CCCCGTGGGC GAAACCGAGG AGGACAGCCT GAGCGCCGCC
ACCTCCCGGA TGGAGTCCAT GAAGGGCAAA AAGAGCTCCA TCTCCTCCAC CCTGACCGGT
GCGGGAGCAA CGACGGCCGT TATGGCTGGC CCCATCAAGA ACATCGTGTT TGCCTGCGAC
GCCGGCATGG GCTCAAGCGC CATGGGCGCT TCGGTTCTGC GGAACAAGAT CAAGGCGGCC
GGCTTCCCCG ACGTCAAGGT CACCAACTCC GCCATTGCGA ACCTGAGCGA CACCTACGAT
GTGGTCATCA CCCACCAGGA CCTGACCGAG CGGGCCAAAC CCGCCACGGG CAGCGCCGTG
CACGTATCCG TGGACAACTT CATGAACAGC CCGCGCTATG ACGAGATCGT GGAGCTGGTC
AAGAGCAGCA ACACCGAAGG AACGGCTGGC GCCGCTGCTC CCGCTGCCGC TGCGGCGCCA
GTGGCGACTG CAGCCCCGTC AGCTGCCGAA GCCGCAACGC CGTCGGACAT CCTGGTGGCT
GACAGCGTTG TGCTCAATGG CACGGCCACC ACCCGCGACG CCGCAATCGA CGAAGCGGGC
CGGCTGCTGC TGGACCGCGG CGCCGTGGAC AGTGGCTACA TCGATGCCAT GCACGAACGC
GAGGAATCCG TGTCCACGTA CATGGGGAGC TTCCTGGCCA TTCCGCACGG CACCAACGCC
GCCAAGGACC ACATCATGAA GTCCGCCGTG TCCGTGATCC GTTACCCGAA CGGCATCGAC
TGGAACGGCA AGGAGGTCAA GTTTGTGGTG GGCGTGGCCG GCATCAACAA CGAGCACCTG
CAGATCCTGT CCTCCATCGC GAAGGTGTTC ACCAACAAGG CCCAGGTGGC ACAGCTCGAG
GCGGCCACCA CGGTTGACGA AGTGCTGGAA CTGTTCGGAA AGGTCAACGC ATAG
 
Protein sequence
MATETVAKPR TSLRVGVQKF GTFLSGMIMP NIGAFIAWGI ITAFFIPAGF TPNEELAKLV 
GPMITFLLPL LIGYTGGRMV HGVRGGVVGA AATMGVIVGT DIPMFIGAMI MGPLTAWIMK
KLDKIWEGRV KPGFEMLIDN FTAGIVAAAM AIVGMLVIGP VVKAFSNGAS SVVEFLVNNG
LLPFTSIFIE PAKVLFLNNA VNHGILTPLG TEQALQNGKS ILFLLEANPG PGVGILLAYM
IFGKGLAKAS APGAALIQFV GGIHEIYFPF VLMKPIIILA AIGGGMTGIF TLVLTGAGLR
SPAAPGSIIA VFAATASDSY FGVALSVLLA ATVSFLIASV ILKSSKTPVG ETEEDSLSAA
TSRMESMKGK KSSISSTLTG AGATTAVMAG PIKNIVFACD AGMGSSAMGA SVLRNKIKAA
GFPDVKVTNS AIANLSDTYD VVITHQDLTE RAKPATGSAV HVSVDNFMNS PRYDEIVELV
KSSNTEGTAG AAAPAAAAAP VATAAPSAAE AATPSDILVA DSVVLNGTAT TRDAAIDEAG
RLLLDRGAVD SGYIDAMHER EESVSTYMGS FLAIPHGTNA AKDHIMKSAV SVIRYPNGID
WNGKEVKFVV GVAGINNEHL QILSSIAKVF TNKAQVAQLE AATTVDEVLE LFGKVNA