Gene Arth_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3936 
Symbol 
ID4444811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4446251 
End bp4448203 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content68% 
IMG OID639691767 
Productmajor facilitator transporter 
Protein accessionYP_833411 
Protein GI116672478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCAT CCCGCAATAG ATCCATGTCC CCCGGTGCCA TCACGGCAGT CCTTGCGCTG 
AGCGGCACGG TGGTGGCGCT GATGCAGACC CTCGTGGTGC CGCTCCTTCC GGATTTCCCC
GGAATCCTCG GCGTTACGGC CGACGACGCG TCCTGGCTGG TCACCGCCAC GCTGCTCTCC
AGCGCCGTGG CCACCCCGAT CGTGTCCCGC AGCGCGGACA TGTACGGCAA ACGTAAAATG
ATGGTGATCT GCCTCGCCAT CATGGTTGCC GGCTCCATCG TGGCCGCGGT GGGGGGAAGC
TTCCTCTGGC TTATCGTCGG CCGGGCACTG CAGGGGTTCT CCTCGGCCTT GATTCCCGTA
GGCATCAGCA TCATGCGCGA CGAACTGCCC AAGGAAAAGA TGGGATCCGC CGTAGCCCTG
ATGAGCGCCA CGCTGGGCAT CGGCAGCGCA CTGGGCCTCC CGCTGGCCGG GCTGCTTTAC
GAAAGCCTGG GCTGGGAGTC CATCTTCTGG GTTTCCGGTG GCGCCGGCAC GCTGCTGCTC
GCTGCCGTCG TCCTGGTGGT TCCCGAATCC AAGGTGCGCA CACCGGGCCG CTTCGACTAC
CTCGGCGCAG TGATTCTTTC CGCGGCACTG GCAGCGCTGC TCCTGGGCAT TTCCAAGGGC
GGATCGTGGG GCTGGAGTTC CGAACCGGTG CTGCTTTTGT TCCTCGCCGC CGCCATCCTC
CTGGCCGCCT GGCTGCCCTA CGAGCTGAAG GTCAGCCAGC CGATGGTGGA CCTCCGCACC
TCCGGCCGGC GCCCTGTCCT CCTGACCAAC CTGGCATCCC TGCTGGTGGG TTTCGCCATG
TTCGCCAACA TGCTGCTGAC CACCCAGCAG CTCCAACTGC CCACCTCCAC CGGCTACGGT
TTCCAGCTCA GCGTGATCAC CGCCGGCCTC TGCATGGTTC CATCCGGCCT GGCCATGGTG
GTCTTCGCTC CCGTTTCCGG CGGCATCATC CGGCGGTTCG GCGGGAAGAC TGCGCTGATC
TCGGGCGCGG CGGTCATGGT GGTGGGGTAC GTTGGCCGCG TCTTCTTCTG GGATTCCATC
GCCTCGGTGA TCATCGGCTC CACGGTGGTC AGCATCGGCA CCGCCATCGC CTACGCCGCG
ATGCCCACCC TGATCATGGG GGCCGTGCCC ATCACCGAAA CAGCCTCGGC CAACGGCCTC
AACAGCCTGG TGCGGTCTAT TGGAACCTCG ACGTCGAGTG CAGCCGTCGC CGCCGTCCTC
ACCTCAGTGA CCATCACCGT GGGATCCGCC CGGCTGCCGT CCTTCGAGGC ATTCAAGGAC
GTCTTCTGGA TGGCCGCCCT GGCGTCCGCG GCCTCCATGG TGGCTGCCGT GTTCATCCCG
CGGGCCGCGG CCGCAGCCAA GGCGGCCCTC CCTGCGCCGG CCGCCACCGA ACTGGTGGTG
CAGGGGCGCG TCCTGACGGC CGACCGCCGC CCGGTCACTC CCGCCGTCGT CACCGTCCTG
CAGACAAGCG GCGAACCGGT GGACTGGAGC CGGGTGGACA GCGATGGCAA CTATTCCGTG
GCACTGCCCG GGGCGGGAAC ATATCTGATG GTGGCCAACG CCGCCGGCTG GGCGCCGATG
GCAGAGGTGT TCGACTTCGA CGGCCGCACG CTCCAGCAGA ACTTCCACCT GGAAAACCGC
CTGGAACTGG CCGGAACCGC CACGGTGGGA GGCACGGCCC TCACGGACGC GGTGGTCACC
CTGTTGCAGG CCTCCGGCGA ACACGTGGCG ACAGTCCGCA CCGATTCGGA GGGACGCTAT
TCACTGCCGC TGCCCTTGGC CGGGCGCTAC ATCGTGACCC TGCTGAACCC GGCGACCCAC
CAGGCCATCG CCCGGAAGCT GGCCGTGGAC AACCGGTCGG TGACCGCGGA CCTGGCGATG
GACGCCCCGG CCGGACAGCT GGTGGACGCG TGA
 
Protein sequence
MPPSRNRSMS PGAITAVLAL SGTVVALMQT LVVPLLPDFP GILGVTADDA SWLVTATLLS 
SAVATPIVSR SADMYGKRKM MVICLAIMVA GSIVAAVGGS FLWLIVGRAL QGFSSALIPV
GISIMRDELP KEKMGSAVAL MSATLGIGSA LGLPLAGLLY ESLGWESIFW VSGGAGTLLL
AAVVLVVPES KVRTPGRFDY LGAVILSAAL AALLLGISKG GSWGWSSEPV LLLFLAAAIL
LAAWLPYELK VSQPMVDLRT SGRRPVLLTN LASLLVGFAM FANMLLTTQQ LQLPTSTGYG
FQLSVITAGL CMVPSGLAMV VFAPVSGGII RRFGGKTALI SGAAVMVVGY VGRVFFWDSI
ASVIIGSTVV SIGTAIAYAA MPTLIMGAVP ITETASANGL NSLVRSIGTS TSSAAVAAVL
TSVTITVGSA RLPSFEAFKD VFWMAALASA ASMVAAVFIP RAAAAAKAAL PAPAATELVV
QGRVLTADRR PVTPAVVTVL QTSGEPVDWS RVDSDGNYSV ALPGAGTYLM VANAAGWAPM
AEVFDFDGRT LQQNFHLENR LELAGTATVG GTALTDAVVT LLQASGEHVA TVRTDSEGRY
SLPLPLAGRY IVTLLNPATH QAIARKLAVD NRSVTADLAM DAPAGQLVDA