Gene Cmaq_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0387 
Symbol 
ID5708627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp422514 
End bp423917 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content45% 
IMG OID641274890 
Productmajor facilitator transporter 
Protein accessionYP_001540223 
Protein GI159040971 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.285838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCTA TAAGAGTCAT GAGGGTGCTC ATATTATTCA CCACTTCATT AGCCGCGTTT 
CAAACCCCCT TCAATTCAAC AGTCCTATCC TTCATAGTTC CTGTACTTGG TAAATACTTC
CACGCCTCAT TGTACACGCT GGTTTACGTG CCTGTGGTTT ACTTAATACC ATTACCGACA
TTAATGGTGC TACTGGGTAG GATTGCTGAT ATTTACGGTA GAGAGAGAGT CTTCAGGATT
GGTTTCGCAT TATTCATAGT GGGTTCACTT ATGGGTGCTT TTTCACCAAG CATCTATGTT
TTAATAGCAT CATCATTAGT GATGGGGCTT GGGTCATCAA TACTATCACC AAGCTCCACA
GCCATAGTTA GCCAAGTCTT CCCAGAGGGT GAGAGGGGGT TTGCCTTAGG TATTAACGCA
ATGGCCGTCT ACATGGGCTT AACCTCAGCA CCATTCCTAG GTGGGTTAAT TACCCAATTC
CTCGGCTGGA GATTCGTATT ACTGGTTACC ACATTACTCT CAGTAATTGG CTTAGCGGTA
TCATTCGTAT CCATGAGGGG TATTGACTTA CCTAGACGCG GCATCCCCAT TGATGCAGCT
GGCGCAGCCT CATTCTCAAT AGCCCTCCTC TCAATAGTAA TATTCATGAT ACTGGCGGCC
ACGGGTGATT GGTTAAATTA CCTTTACCTA CCAGTAATTA GTGCGGCTTC ATTTGCTTTA
TTCATAGTGA TCGAGGGGAG GGTTAAGGAT CCTATGCTTA ACTTAAGCTT ATTCACCCGT
AACATATCAT TCATGGCTGG TAACGTGACT GCTTTACTAA ACTACATAAG CACGTACTCG
GTACCATTCC TGTTCTCACT CTACCTACAG TCAATACTCG GCTACACACC CTTTGAGGCA
GGCCTAATAC TAATCCCTGA ACCAGTATTC ATGGTAATAC TCTCACCCAT TAGTGGTAGA
CTCTCCGATA TCTATGGTTC AAGGGAAGTG GCTGCATTGG GAATGGGGCT CATAGGCTTA
GCGTTCATAA TGCTACTTAT CCTTAACCTA AGGAGTGTAG TTAACGTGGT ACTGGCTTTA
TCGGTATTAG GCGTAGGCTT CGGCTTCTTC TCAGCACCCA ACACTAACTC AGTAATGGGC
TCAATAACAC GGGATAAGTA CGGTGTGGCA TCGGGGGTAT TGGGTACCAT GAGGTTCACC
GGCCAATTAC TAAGCATAAC CCTAGCCAGC GCGATACTGG CTAAGTACCT GGGTAAGTAC
ACTGCATTAT ACCTATTCAC TGGAGTACCA TTAATGAGCA CTATAGTGTA TGGTTTATTC
ACAGCGGGGT TGAGGATAAT GTTCATCATA GCTGCTGCAT TAAGCTTCAT AGGTGCATAC
ACGTCACTAC TTCGTGAAAG GTAA
 
Protein sequence
MESIRVMRVL ILFTTSLAAF QTPFNSTVLS FIVPVLGKYF HASLYTLVYV PVVYLIPLPT 
LMVLLGRIAD IYGRERVFRI GFALFIVGSL MGAFSPSIYV LIASSLVMGL GSSILSPSST
AIVSQVFPEG ERGFALGINA MAVYMGLTSA PFLGGLITQF LGWRFVLLVT TLLSVIGLAV
SFVSMRGIDL PRRGIPIDAA GAASFSIALL SIVIFMILAA TGDWLNYLYL PVISAASFAL
FIVIEGRVKD PMLNLSLFTR NISFMAGNVT ALLNYISTYS VPFLFSLYLQ SILGYTPFEA
GLILIPEPVF MVILSPISGR LSDIYGSREV AALGMGLIGL AFIMLLILNL RSVVNVVLAL
SVLGVGFGFF SAPNTNSVMG SITRDKYGVA SGVLGTMRFT GQLLSITLAS AILAKYLGKY
TALYLFTGVP LMSTIVYGLF TAGLRIMFII AAALSFIGAY TSLLRER