Gene Arth_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0365 
Symbol 
ID4447159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp386190 
End bp387569 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content64% 
IMG OID639688161 
Productgeneral substrate transporter 
Protein accessionYP_829866 
Protein GI116668933 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTAG AACAACGCTC TGCAACACCG GCCGCTTCCG CCCCAAAGGG CGCCGGCCTG 
AAGAAAATCG TCGCAGCCTC CATGGTTGGC ACCGTAGTGG AATGGTACGA ATTCTTCCTC
TACGCCACTG CAGCAACCCT GGTGTTCGGA AAGTACTTCT TCCCGGCCAC CGGCAATGAG
CTGGACGGCA TCATCCAGGC CTTCATCACC TACGCGGTCG GCTTCGTGGC CCGCCCGCTC
GGCGGCATCG TCTTCGGCCA GATCGGAGAC AAGCTGGGCC GCAAACCCAC CCTTCAGCTC
ACCATCGTGA TCATCGGTGT GTCCACCTTC CTCATGGGCT GCCTCCCAGG CTTCGCTGAG
ATCGGCTACC TTGCCCCGGC ACTCCTGGTG TTCCTGCGCT TCATCCAGGG CTTCGCGCTG
GGCGGCGAAT GGGGCGGCGC CGTGCTGCTC GTCGCCGAGC ACAGCCCCAA CGAGTCCCGC
GGATTCTGGT CCAGCTGGCC CCAGTCCGCC GTTCCCGTAG GCAACCTGCT GGCCACCCTG
GTCCTGTTCA TCATGTCCAA CGTCCTCAGC AGCGCCGACT TCCTCAGCTG GGGCTGGCGC
GTGGCGTTCT GGCTCTCCGC GGTGATCGTG TTCGTGGGCT ATTACATCCG CACCAACGTC
TCCGAGTCGC CCATCTTCCT CGAAGCGAAA GCCCGGTTGG AGCAGGAGCA GGCCGTCAGC
TTCGGCGTCG GCGAGGTCAT CCGCAAGTAC CCCAAGGGCA TCCTGCAGGC CATGGGCCTC
CGGTTCGCGG AAAACATCAT GTACTACCTC GTGGTCAGCT TCGCGATCGT GTACCTCAAG
AGCGTCCACA AATACGACAC GTCCTCGCTC CTGCTGGCAC TGCTGATCGC CCACCTCATC
CACTTCCTGG TCATCCCGCA GGTGGGACGG CTCGTGGACA GCTGGGGGCG GAAGCCCGTG
TACCTGGTGG GCGCCATCAC CGGTGCCACC TGGCCGTTCT TCGCTTTCCC CATGTTCGAC
ACCAAGAACG CCGTGGTCAT CGTCCTGGCA GTGACCATCG GCCTGTGCCT GCACGCGTTC
ATGTACGCGG GCCAGCCGGC CCTCATGGCG GAGCTCTTCC CCACCCGGAT GCGCTACGCA
GGTGTGTCGC TGGGCTCGCA GGTCACCTCG ATCTTCGCCG GTTCGCTGGC GCCGCTCCTG
GCCACGCAGT GGCTCAAGGA CACCGGATCG TGGGTCCCCA CCGCCATCTA CCTGGTGGTG
GCGTGTGCCA TCACCACGGT GGCAGTGTTG AGCCTCAGGG AAACCAAGGG CATTGCCCTC
GAGGAAGTTG ACCGGGCCGA CGCCGAGCGC GAAGGCCTGG CGGTAGCAGC CGCACGTTGA
 
Protein sequence
MSVEQRSATP AASAPKGAGL KKIVAASMVG TVVEWYEFFL YATAATLVFG KYFFPATGNE 
LDGIIQAFIT YAVGFVARPL GGIVFGQIGD KLGRKPTLQL TIVIIGVSTF LMGCLPGFAE
IGYLAPALLV FLRFIQGFAL GGEWGGAVLL VAEHSPNESR GFWSSWPQSA VPVGNLLATL
VLFIMSNVLS SADFLSWGWR VAFWLSAVIV FVGYYIRTNV SESPIFLEAK ARLEQEQAVS
FGVGEVIRKY PKGILQAMGL RFAENIMYYL VVSFAIVYLK SVHKYDTSSL LLALLIAHLI
HFLVIPQVGR LVDSWGRKPV YLVGAITGAT WPFFAFPMFD TKNAVVIVLA VTIGLCLHAF
MYAGQPALMA ELFPTRMRYA GVSLGSQVTS IFAGSLAPLL ATQWLKDTGS WVPTAIYLVV
ACAITTVAVL SLRETKGIAL EEVDRADAER EGLAVAAAR