Gene Arth_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1721 
Symbol 
ID4445760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1922603 
End bp1923901 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content66% 
IMG OID639689543 
Productmajor facilitator transporter 
Protein accessionYP_831215 
Protein GI116670282 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.309946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGACA CAAAGTCGAC GGCGGACGGG GCCTGGCTAG CTGTCCTCGC TGCAGGCATC 
GCGGCGGCGA TGCACATCTG GAAACTTCCG GCTGCCCTGG CCGGCATCCA GGCGGATCTC
GGCACCTCAC TGATCCAGGC CGGACTGCTG CTCGGCATCA TCCAGTTGGC CAGCGTTGTG
GGCGGCCTCG CCACAGCCGT GGGCGGTGAA ATGGCCGGCC TGCGCCGGCT CCTTATCTCC
GGCCTGGTCC TGCTCAGCGC TGCCTCCATC CTGGGCGCAG CCGCCCCCAG CACCGAATGG
CTGATGGCTG CACGCACGCT TGAGGGTGTC GGCTTCCTGC TGGCCGTCGT CGCCGCCCCT
GCCCTCATCC GCAAAGTGGT CCCGCCCCAG CGGTTGAACG TTGCACTGGC AAGCTGGGCC
ACGTTCCAGG GAACGGCCAC GCTGATTGGC CTGTCCTCCG GTGCCTTGTT CCTGCAGGCC
GTCGGCTGGC GCGAATGGTG GCTGGTCATG GCCATTCTCA CGCTTGTCCC CGTGCCGCTG
CTGCTGCGCC GGGTGCCGAG GGACGTGGCC CCCGCCGACG CCGGGCTGCG CTCCGCACTC
CGCAGGGTGG GCCGCACCGT TGCCACCCGC AGGCCATGGA TCATAGGTCT GGTCTTCGCC
TGCTATACGG CGCAGTGGAT GGCGGTGCTC GGCTTCCTGC CCTCAATTTA CCGGTCAGCG
GGGCTCGGCG GTCCGTGGCC GGGCATCCTC AGCGCCATGG TCGGAGGAGT AAACGCCATC
GGGGCCCTCA GCGCCGGTCC GCTGATCCAG CGTGGCCTGT CGGAGCGGAG AATTATCTTT
TGGACCTTCC TGTCGATGTC CGCCGCTTCC GTGGCCACGT TTGCCATCGG CTGGGAAAGT
CTCGCGAACG GTGTCCTCGT CCAGGTCGTC TTCATAGCCC TGTTTTCAGC TGTGGGGGGC
CTCATCCCTG CCGCGGTGAC CAGCTACTCG GTGCGCATCG CACCCGCCGA CGGTTCTGTC
ACCGCCGTGC TGGGGCTGAC GCAGCAGATC TTTAACGTGG GAAACTTTCT GGGGCCCATG
TTGTTTGCCC TGCTCGCCAC CACCACCGGC GGCTGGGGCA CCACTTGGTG GCTAACATGT
GGGTTGAGCT CTCTGGGGAT GGCTCTGCTG GTATTCCTGG GACGGGCCGA CGGCGCGAGC
ATCGGCGGTG TGCGACGCCG CAAAAGTGGC TTTCCGCCCA ACGGAAAGCT CAAATGTAAG
GCCGGTCACA CAAGCCTAGA GTCGAGGAAA CGCCGGTGA
 
Protein sequence
MDDTKSTADG AWLAVLAAGI AAAMHIWKLP AALAGIQADL GTSLIQAGLL LGIIQLASVV 
GGLATAVGGE MAGLRRLLIS GLVLLSAASI LGAAAPSTEW LMAARTLEGV GFLLAVVAAP
ALIRKVVPPQ RLNVALASWA TFQGTATLIG LSSGALFLQA VGWREWWLVM AILTLVPVPL
LLRRVPRDVA PADAGLRSAL RRVGRTVATR RPWIIGLVFA CYTAQWMAVL GFLPSIYRSA
GLGGPWPGIL SAMVGGVNAI GALSAGPLIQ RGLSERRIIF WTFLSMSAAS VATFAIGWES
LANGVLVQVV FIALFSAVGG LIPAAVTSYS VRIAPADGSV TAVLGLTQQI FNVGNFLGPM
LFALLATTTG GWGTTWWLTC GLSSLGMALL VFLGRADGAS IGGVRRRKSG FPPNGKLKCK
AGHTSLESRK RR