Gene Arth_4342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4342 
Symbol 
ID4443488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp82080 
End bp83306 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID639687663 
Productmajor facilitator transporter 
Protein accessionYP_829360 
Protein GI116662306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGTA CCGACGATTC TGCACCGGAC GATCCGCGGC GCCTTCGTCG CGACCACCAC 
CTCATCCAGT CAGCGAATCT CGGATTCTCG GTCGCGCAGG CAATGGCGGC CGTCGCCGTG
CCAATCCTGG CCGTGTACGC GGGGCACCCC ATCGAGCTGA TCGGGATCAT TGTCGCGGTC
TCTGCGGTGT CGCAGACCGT CGCGCGCCTT GGCATGGGCA CCCTGATGAG CCGGCTGCCC
ACCAAGCACT TCATTGCTGC CGCTACGCTG CTGCTGTCTG CATCCTGTTT CCTCCTCGGC
TTCAGCACCG AGTTGTGGGC CTTCATCATT GCCCAGCTCC TGCAAGGAGC CGCGCGCGCG
TATTTCTGGA CCGGAAGCCA GACGCATGTT GTCCGCGCGT CCGAGTCCGC CGTCACCGCG
CTCTCCCGCC TGAACGTGGT TCAGGGCGTA GGCCAGCTGA TCGGACCGGC ACTCGCGGGC
TTCATTGGTG CCTGGTCCCT GCAGATGTCC CTCCTGGCAG CGGGCGCACT CGCGGCGATC
GCGCTCGCGC CGGCGATCGC ACTCGTCAGA TTTGCCCCGT TCCCACAGCG GAGCCGCCAC
GGCACCGGGC GCCCTCGGCA GATCTGGCGT CAGCCAGGTG TCGGCATGGC TGCCAGCATG
GCGGCGGTCG CGGGTGCGTG GCGCGGCATC CTCAATTCCT ACCTGCCCGT TATCCTGACC
GCGGCCGGCC ACAGCATTCC CGTCGCCGGC GCGCTGATGA CGGTCGCCAA TCTCGCGTCC
CTTTGCGGCA GTGCGTTCTC CCGCCGCATC CATGCCGCGG GTCCGCGCGT CGCGAATGCG
ATCGGCACGG CGGGGGCAGG CCTCGGACTT GTGCTTGCGT CGTTTTTCCC GAATCCGATC
TGGGTAGTCG CTGTGGGGCT CACCATCTCG GGCCTCGGTG CCGGGATTCT GCAGACGGTC
GGTCCGGCAT TGGCTGTGGA TTCCATCAGC GAAGAGGATC GTGGGCGTGC CATCGCATCC
ATCGGGACGT TCCGGTCAAT ATCGCTGTTC GTGTCCCCTC TGGCGACCGC AGGGCTCATC
CTCATCGTTC CCAGCGCTGC TATCGCCGCG GGGATCGCCG GTATCATCAT TTCTACGCCA
ACCCTGTCTA CTCTGATCAG ACGCAGAGGC CAGGCGAGGA CGTCCCAGGA AGGCACCCAT
GACCACGACG AAGACTTTGC GAACTGA
 
Protein sequence
MSRTDDSAPD DPRRLRRDHH LIQSANLGFS VAQAMAAVAV PILAVYAGHP IELIGIIVAV 
SAVSQTVARL GMGTLMSRLP TKHFIAAATL LLSASCFLLG FSTELWAFII AQLLQGAARA
YFWTGSQTHV VRASESAVTA LSRLNVVQGV GQLIGPALAG FIGAWSLQMS LLAAGALAAI
ALAPAIALVR FAPFPQRSRH GTGRPRQIWR QPGVGMAASM AAVAGAWRGI LNSYLPVILT
AAGHSIPVAG ALMTVANLAS LCGSAFSRRI HAAGPRVANA IGTAGAGLGL VLASFFPNPI
WVVAVGLTIS GLGAGILQTV GPALAVDSIS EEDRGRAIAS IGTFRSISLF VSPLATAGLI
LIVPSAAIAA GIAGIIISTP TLSTLIRRRG QARTSQEGTH DHDEDFAN