Gene Arth_3998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3998 
Symbol 
ID4447261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4512863 
End bp4514515 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID639691829 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_833473 
Protein GI116672540 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAA CCGCCGCCGG GCCCGTAGCC GGGGACGTCC TGACGAAGCG TCAGACCATC 
ACAGTGATGG TGGGTCTCAT GCTCGGCATG TTCCTGTCCT CGCTGGACCA GACCATCGTG
TCCACGTCGA TCTACACCAT CGCCAACGAC CTCGACGGGC TCTCCCTGCA GGCCTGGGCC
ACCACCGCGT ACCTCATCAC CTCCACGGTG AGCACACCGC TGTACGGCAA GCTCAGCGAC
ATCTTCGGAC GCCGGCCGCT CTACCTGACC GCCATCGTAA TTTTCCTGGC GGGTTCGCTG
TATGCCGGTT CGGTGCACTC CATGACCGAA CTGGCCATCG CACGCGGCAT CCAGGGCATG
GGCGCCGGCG GCCTGCTGGC CCTCGCGCTG ACCATCATCG GCGACATTGT GGCCCTCAAG
GACCGGGCCA AGTTCCAGGG CTACTTCATG TCCGTCTTCG GCATCTCCTC GGTCCTCGGC
CCCGTGGTGG GCGGCGCCTT CGCCGGGTCC GCGAACATCC TGGGCTTTGA CGGCTGGCGC
TGGGTGTTCT TCATCAACCT GCCCATCGGG CTGGCCGCAC TGGCCGTCGT CTTCCTCTAC
CTGCACCTGC CCGCCAAACA CGTCAAGCAG AAGATCGACT ACTGGGGCGC CGCGGCCATC
ACGCTGGCCA TCGTTCCGCT GCTCCTCGTA GCCGAACAGG GCCGCAGCTG GGGCTGGACC
TCGGCGGCGT CCCTCCTGTG CATCGGACTG GGCATGGTGG GCATCATCGC GTTCCTGCTG
GCCGAGAAAC GCGCCGGCGA TTACGCCCTG ATTCCGCTCC GGCTCTTCCG GAACCTCACG
TTCGGCCTGT CCTCATTGCT GAACTTCATC ATCGGCATCG GCATGTTCGG CGCCATCGCG
ATGCTCCCGA TGTACCTCCA GCTGGTCAAG GGCCTCACCC CCACCGAGGC CGGCCTGATG
ATGATCACCT TCACCGTGGG CATCCTCACC GGTTCCATCA CCGCCGGACG GACCATCTCG
GCGTCAGGCA CCTACCGGAT CTTCCCCATC ATGGGCACTG CCGTTCTCAC CGCGGCCGCC
ACCGTGATGG GCTTCTCGCT GGGCGTCGAC ACCGCGCTCT GGGTGCCGGG CCTCATCGCG
GTGTTCTTCG GCCTGGGCCT GGGCTTCTGC ATGCAGCCCC TCACGCTGGC CATGCAGGTG
TCCGTTCCTC CGAAGGACAT GGGCGTGGGC ACCTCCTCCG CGGCGTTCTT CCGGTCCATG
GGCGGCGCCG TGGGCACCGC GGTCTTCATT TCCATGCTGT TCAGCCTGGC CGCGGACCGC
ATCGCCACCG GCATGAAGGA TGCCATGCAG AACGCCGACT ACCTGAAGGT CCTGAAGGAC
CCCGCCGTGG CCGCCGACCC TGCCAACGCC AAGCTGTACG AGTTCTTCAA GAACGGCGCC
ACCAACGACT CCCTCAACGA CACCAGCTGG CTGCACACGG CCAACCCCGT GCTCACCCGC
CCCATCACCG AGGGCTTCGC ACAGTCGATC GACGCCGTGA TGCTCACTGC AGCCGTGCTG
ACCGGGGTTG CCTTCCTGAT CAGTTTCGCG CTGCCGAAGA AGAAGCTGAC GGACCCGAAG
ACGGCTTCAA AGGAGGCGGT GGCGGCCCAC TAA
 
Protein sequence
MSKTAAGPVA GDVLTKRQTI TVMVGLMLGM FLSSLDQTIV STSIYTIAND LDGLSLQAWA 
TTAYLITSTV STPLYGKLSD IFGRRPLYLT AIVIFLAGSL YAGSVHSMTE LAIARGIQGM
GAGGLLALAL TIIGDIVALK DRAKFQGYFM SVFGISSVLG PVVGGAFAGS ANILGFDGWR
WVFFINLPIG LAALAVVFLY LHLPAKHVKQ KIDYWGAAAI TLAIVPLLLV AEQGRSWGWT
SAASLLCIGL GMVGIIAFLL AEKRAGDYAL IPLRLFRNLT FGLSSLLNFI IGIGMFGAIA
MLPMYLQLVK GLTPTEAGLM MITFTVGILT GSITAGRTIS ASGTYRIFPI MGTAVLTAAA
TVMGFSLGVD TALWVPGLIA VFFGLGLGFC MQPLTLAMQV SVPPKDMGVG TSSAAFFRSM
GGAVGTAVFI SMLFSLAADR IATGMKDAMQ NADYLKVLKD PAVAADPANA KLYEFFKNGA
TNDSLNDTSW LHTANPVLTR PITEGFAQSI DAVMLTAAVL TGVAFLISFA LPKKKLTDPK
TASKEAVAAH