Gene Arth_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3881 
Symbol 
ID4446839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4365897 
End bp4367426 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID639691706 
Productmajor facilitator transporter 
Protein accessionYP_833356 
Protein GI116672423 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACCCC CTCCGCCTTC CACGGCAACC GTCAGCCAGC CCGTCATCGA TTCCGCCGCA 
TCTCCGCCGC TGCAGACTGC ACCGCAGAAC GCACCGCAGA CCCTGCCGCA GCCGCGGGAC
GTCACTCAGC CGATCGCCGT CGTCAGCGAA CGCCTGCCCT GGCGCCACAC CTTCATTTCC
CTCCGCGTCC CCAACTTCCG CATTTTTGCC ATCGGACACT TCATCGCGGT GATCGCGCTC
TGGATGCAAC GGATCGCGCA GGACTGGCTC GTGCTGCAGC TATCCGGCTC CGTCACGGCG
GTCGGCATCA CCGTGGCGCT GCAGTTCATG CCGTCGCTGG TGCTGGGGCC GTGGGGCGGG
ATGATGGCGG ACAGGTTCCC CAAACGGAAG ATCCTCATCC TCTGCCAGTC AGTGGCCGCC
GTGCTTGCCG CCGCCCTTGC CGTCCTGGCG CTGAGCCAGC GCATCGAAGT GTGGCACGTC
TACGCGATCG CCCTGGTCCT GGGATTGGTC ACCGTGCTGG ACCAGCCGGC CCGGCAGGTC
TTCGTCAACG AACTCGTCGG CCCCACGTAC CTGCGCAACG CCATCAGCGT GAACTCCACG
ACCTTCCAGC TGGGCGGCCT GATCGGGCCC GCACTAGCCG GGCTCCTGCT GACCGCGGTG
GGTGCCGGCT GGGCCTTCGC CGCCAATGCC GTGGCCTGCT GTTCCACGGT GGCAATGCTG
CTGCTCCTGC GCAAGGACCA GCTGTTCATC ACTGCGCCCG CGCCGAAGCG CAAGGGCATG
CTCCGGGAGG GGCTACAGTA CGCGCTGAGC AAGCCCACCA TCTACTGGCC CTGGCTGATG
GCAGGGTTCA TCGCAGTTTT CGCCATGAGC CTGCCGGTGC TGCTGGCCGC CTTCGCGGAC
AACGTGTACG ACGCCGGCGC CGGAGGCTAC GGCCTGCTGA ACGCGCTGGT GGCGCTGGGT
GCGCTCGCCG GGGCTGTCAC CTCCACCCGC CGCCGGCAGC TGCGACTGCG GTCGGTGGTG
CTGGGTGCCG GAATGTACGG GCTGATGCTC TGCCTCGCGG CCCTGGCGCC GTCCATGGTG
TGGTTCGGCG CCGCCATGGT GCTCTCCGGA TTCTGGTGCC TGATGTTCCT AACCGCGGCC
AACCAGCTGG TGCAGATCAG TTCCAACATG GGAATCCGGG GACGCGTCAT GAGCCTGTAC
ATCATGGTGC TGATCGGCGG GCAGGCCATC GGCGGTCCCA TGATGGGCTG GATTGCCGAG
CACCTGGACC CGCACACCGC CATCCTCGTT TCCGGCGGGG TGCCGGTCCT GGCAGCGGTG
ACTGTCGCCG TCGTACTGGC CCGGCGCGGT GAGCTGACCC TCAAGGTGAA CCTCAGGGAC
CGGCACCACC TCATCCGGAT AGTCAGCCGG AAGGCAAAGA AGCCGGGGCG CGGGCCCTCA
GGGCCGGCGC CCCGGCCGGC TGCTACGCGT CCAGAGGGGC GTGCGGGTAC TCCGCCTGCC
GCTGCTGGAA CTGCAGCACG TCCTTGTTGA
 
Protein sequence
MAPPPPSTAT VSQPVIDSAA SPPLQTAPQN APQTLPQPRD VTQPIAVVSE RLPWRHTFIS 
LRVPNFRIFA IGHFIAVIAL WMQRIAQDWL VLQLSGSVTA VGITVALQFM PSLVLGPWGG
MMADRFPKRK ILILCQSVAA VLAAALAVLA LSQRIEVWHV YAIALVLGLV TVLDQPARQV
FVNELVGPTY LRNAISVNST TFQLGGLIGP ALAGLLLTAV GAGWAFAANA VACCSTVAML
LLLRKDQLFI TAPAPKRKGM LREGLQYALS KPTIYWPWLM AGFIAVFAMS LPVLLAAFAD
NVYDAGAGGY GLLNALVALG ALAGAVTSTR RRQLRLRSVV LGAGMYGLML CLAALAPSMV
WFGAAMVLSG FWCLMFLTAA NQLVQISSNM GIRGRVMSLY IMVLIGGQAI GGPMMGWIAE
HLDPHTAILV SGGVPVLAAV TVAVVLARRG ELTLKVNLRD RHHLIRIVSR KAKKPGRGPS
GPAPRPAATR PEGRAGTPPA AAGTAARPC