Gene Arth_4360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4360 
Symbol 
ID4443471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp99119 
End bp100375 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID639687681 
Productmajor facilitator transporter 
Protein accessionYP_829378 
Protein GI116662324 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0242739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCATCAT CCTGCAAAGG CCAGACCGTG GGCCTGTCCG GCGCTGACAG AATCCTGAGC 
ATTGCCGCGA CGGTATTGTT CTCGTTCGCC CTTGGAACCC TGGCAGTGGT GGTGCCGATA
TTGGCCATTG CCGTGGGGTA CAGCACGGTC GAAGTCGGAA TGATGGTTGC ACTTGCAGCA
GTCTCGCAGT TAGTGACCCG GATCTTCCTG GGAGCACTGA TGCGCAGGGT CCCAGACAAA
GCCCTCCTCG TGGGAGCAGC GCTCATGATT GCCGTCTCGT GCGCTCTGAT CGCAGTCTCA
GATGCATTGG CAATTTTCGT AGTATCCCAG CTGGTCCAGG GCTCGGCTCG AGCCTTGTTC
TGGACCAGCA GCCAGACGCA CGCCGTCCGC ATGTCGACCT CGTCTGTCAA GGGACTGACT
GACGTGAATC TCGCCGCAGG GGTCGGAGCA CTGCTGGGAC CTGCACTTGC CGGTTATCTC
TGGGAACTTT CGACACCGCT GCCGCTGATC GTGGCGGCAG CGGCAGGATC GGCGGCCGTC
ATCCCTGCTG CCCTTCTGAC TAGGCTTCCT GTGTTCGCTC CGGAGCATGC CACAGGCGGA
ATAATGGCGC GAGGCCTCTG GCGGCGCCCT GGCGTCGATG CCGCCTGCTG GATGAACGCT
GGAGCGGGCG CCTGGAAGAG CCTCTTGGAC TCCTACGTTC CCATCGTGCT CTCCCTTGCC
GGCCAACCCG TCGCCGTCAT CGGAATCCTG GTAGCCATAC CCAACGCTGC CGTCCTGGCC
GGAAGTGCAT CTGCGAGCTG GCTGCGCAGG CGGGGAAACA GGACGTCCCT TCTCACGGGC
CTCCTGGCCA CGGGGACGGG CTTGGCCGCG GCGGGGCCAC TGGCCGGCGC CGCAGTTGCC
GCCGCCGCCG CCCTCGCTGT TTCCGGCGTT GGTGCCGGGA TCCTCCAGAC TGTCGGACCG
GCCATAGCTG CAGATGAAGT GCACCCGGAA GAACGCGGGG ATGCTCTCTC CCTGACTGGC
ACGGTACGGG CTTCAGCGCT CTTCCTAACA CCGTTTGTGA TGGCACTGCT GGTCAGCGTG
GTCCCTGTTG CCGCAGCGCT GGTCACGGCC GGCGTCCTCA TGACCCTTCC AGCAGCTGGA
AGTATCCGCA GAAAAGACCT TCCAGCGCAG ATCGCGCCAA CAGAATCAGG TCCAGGCAGG
CCGCTCCAAC AAACTCAGCT TGAAGCTCCC GAGAAAAAGG TGCCCGATGA CAACTGA
 
Protein sequence
MPSSCKGQTV GLSGADRILS IAATVLFSFA LGTLAVVVPI LAIAVGYSTV EVGMMVALAA 
VSQLVTRIFL GALMRRVPDK ALLVGAALMI AVSCALIAVS DALAIFVVSQ LVQGSARALF
WTSSQTHAVR MSTSSVKGLT DVNLAAGVGA LLGPALAGYL WELSTPLPLI VAAAAGSAAV
IPAALLTRLP VFAPEHATGG IMARGLWRRP GVDAACWMNA GAGAWKSLLD SYVPIVLSLA
GQPVAVIGIL VAIPNAAVLA GSASASWLRR RGNRTSLLTG LLATGTGLAA AGPLAGAAVA
AAAALAVSGV GAGILQTVGP AIAADEVHPE ERGDALSLTG TVRASALFLT PFVMALLVSV
VPVAAALVTA GVLMTLPAAG SIRRKDLPAQ IAPTESGPGR PLQQTQLEAP EKKVPDDN