Gene Arth_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0961 
Symbol 
ID4446519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1033975 
End bp1035366 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content67% 
IMG OID639688767 
Productmajor facilitator transporter 
Protein accessionYP_830458 
Protein GI116669525 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTGGC TTGACCGTGA CCGCACCATC GCCCCGCCTG GATTCAACCG CTGGCTGGTA 
CCGCCCGCGG CGCTCGCCGT CCACCTCTGC ATCGGCCAGG CCTATGCCAC CAGCGTCTAC
AAGACGGCGC TGGTCAAGCA CTTCGGGGCC AGCCTGACGG AGATCGGCGT GATCTTCTCC
ATCGCCATCG TGATGCTGGG CCTCTCGGCC GCGATCATGG GCACGTGGGT GGACAGGAAC
GGCCCCCGCA AGGCGATGTT CACCTCGGCC ATGTTCTGGG CCAGCGGATT CCTGATCGGT
TCGGTGGGCA TCTTCACCCA CCAGCTGTGG CTCGTGTACC TCGGCTACGG CGTAGTGGGC
GGAATCGGCT TGGGCATCGG CTACATCTCG CCCGTGTCCA CCCTGATCAA GTGGTTCCCG
GACCGCCCCG GACTCGCGAC CGGCATGGCC ATCATGGGCT TCGGCGGCGG CGCGCTGATC
GCCAGCCCCG TCTCCACCGC GCTGCTCAAG CTGTACGATC CCAACTCCGG CGCCAAGGGC
TGGGTGGCCA GCGGCGATTC GGTGGGCAAG CTCTTCCTGA CGCTCGCCGT CGTCTACCTC
GCCTACATGC TCTTCGGCGC CCTCACCATC CGGGTCCCGG CCGAAGGCTG GCGCCCCGCG
GGATTCGACC CCGCCAAGGT CAAGGCCGCC AAGCTGGTCA CCACCGAGAA CGTCTCGGCT
AAGAACGCGA TCAAGACCCC GCAGTTCTGG CTGGTGTGGG TGGCGCTGTT CTGCAACGTC
ACCGCGGGCA TCGGCATCCT GGAACAGGCA GCGCCCATGA TCCAGGACTT CTTCCGAAAG
TCCGACGGCG TGTCCCTGGT CAGTGCCGGC GTCGCCGCCG GCTTTGTGGG GCTGCTTTCC
ATCGGCAACA TGGCCGGGCG CTTCGCCTGG TCCGCCACTT CCGACGTCAC GGGCCGCAAG
CGCATCTACA TGGTGTACCT GGGCGTGGGC GCCGTGCTCT ACACGGTGCT TGCGCTGGCT
GGATCCAGCA CCACCGTCCT GTATGTGGCG CTGGCGTTCT TCATCATCTC CTTCTACGGC
GGCGGATTCG CCACGGCACC GGCCTACCTG CGGGACCTCT TTGGCACCTT CCAAGTGGGC
GCCATCCACG GCCGGCTGCT GACCGCCTGG TCCGCCGCCG GCGTGGCCGG GCCGCTGATT
GTCAACGCGT TCCTGGACGC GCAGGGCAAA CCCGGACAGC TGAACGCGGC GTCCTACCAG
CCGGCGCTGC TGACCATGGT GGCGCTGCTG GTGGTCGGCT TCGTCGCCAA TCTGCTGGTC
AAACCGGTGG ACGCACGGTT CCACGAACTC CGCACCGACC GCCGTCGGCC CGAACCTGCC
TTGGAGGCCT GA
 
Protein sequence
MGWLDRDRTI APPGFNRWLV PPAALAVHLC IGQAYATSVY KTALVKHFGA SLTEIGVIFS 
IAIVMLGLSA AIMGTWVDRN GPRKAMFTSA MFWASGFLIG SVGIFTHQLW LVYLGYGVVG
GIGLGIGYIS PVSTLIKWFP DRPGLATGMA IMGFGGGALI ASPVSTALLK LYDPNSGAKG
WVASGDSVGK LFLTLAVVYL AYMLFGALTI RVPAEGWRPA GFDPAKVKAA KLVTTENVSA
KNAIKTPQFW LVWVALFCNV TAGIGILEQA APMIQDFFRK SDGVSLVSAG VAAGFVGLLS
IGNMAGRFAW SATSDVTGRK RIYMVYLGVG AVLYTVLALA GSSTTVLYVA LAFFIISFYG
GGFATAPAYL RDLFGTFQVG AIHGRLLTAW SAAGVAGPLI VNAFLDAQGK PGQLNAASYQ
PALLTMVALL VVGFVANLLV KPVDARFHEL RTDRRRPEPA LEA