Gene Cmaq_0759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0759 
Symbol 
ID5708969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp789603 
End bp790814 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content44% 
IMG OID641275261 
Productmajor facilitator transporter 
Protein accessionYP_001540585 
Protein GI159041333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATT TAAGGGAGAT TGATTATGGG CCATTAGGGA GGAGTAGGTT CTTCGCCATA 
ACATCAACGG CACTGGGCAT GTTCCTATGG GGTTTCATAC TGGCATTAGC CCCATTAACA
ACGCAATGGC CCTTCGTACC AAGTAGTTTT GATGAGTACG TACTCTTAGC GGCGCCAGTA
TCATTAATGG TTGGTAATTT AATCATCGGG AAATTATCTG ACGCCTTGGG TAGGAAGATG
GGTTTCATAA TGACCATGGT TATTTACGGT GTTGGTGCAT TACTAATAAT ATTATCCAGT
AACGTTTACG AATTAATAGC CGGCATAGCT CTAGCTGAAT TTGGTCTTGG TGGTGAGGAG
CCTGCAACGT TAGCCTATAT GGCTGAAATG ATGCCCATTA GGTTGAGGGA GAGGATCATC
ATAGGTGTTA CTAATGTGGC TAATATTGGT GCCGCGGTGG CGGCTGCATT GTCCCTTGTT
GCTTCATCAA TGTACCTTCA GAAGCTGTTC TTTGGTGTAA CAATACTAGT CACCATAATT
ATAATATTAG CCACTAGGCT GCTGATACCT GAATCATATA GGTGGAGTAA CGTGAAGTCG
TCATGTAGGG TGATTGGGTT AAGTGAGATG AAGGGATTTA GGATTAGGTT AATATTCTTA
ACACTCATAG CCTTAACAAT AGTGTTAACA TATGCATTAC TGGCTCTCGT AATGGGTCCT
TACCTCTTTC CGAAATTAAC CTCATGGATC GTGCTACTGT ATAATGTTGG TGAATCAGTG
GGTGGCTTAA TAGGCATGGC CTTAGCTGAT ATAATGGGTA TGAGGAAGTT CACCCTAATG
GCCTACCTAG GTGGTTTCAT CACAATGCTA CTGGTTATAC CTCAATTAGC CTTTGCACCG
CAAAACCTAC CACTATTCCT TCTAATACTA TCTGTTAACG GCGTATTTGG TGAACTGGGG
TGGGCCGCTA GGGTTGTACT GGAGCCTGAG TTATTTCCAA CAGGCAGTAG ATCCATGGGT
GTTGCCTTGG TTAGGGCAAT AGCCTACGTC GCCTACATAG CATCAATCTT CATAACAGCT
GGTTTCACAA TATGGCAATA CGCATGGTAT AATGTCGCAC TATGGAGCCT TGGGCTATTA
GGTGCATTGA TTTGGTTTAC CCATGGTGTT GAAACCAGGC TTAAGACCCT GGAGGAGGTT
AATCCGCAGT GA
 
Protein sequence
MSNLREIDYG PLGRSRFFAI TSTALGMFLW GFILALAPLT TQWPFVPSSF DEYVLLAAPV 
SLMVGNLIIG KLSDALGRKM GFIMTMVIYG VGALLIILSS NVYELIAGIA LAEFGLGGEE
PATLAYMAEM MPIRLRERII IGVTNVANIG AAVAAALSLV ASSMYLQKLF FGVTILVTII
IILATRLLIP ESYRWSNVKS SCRVIGLSEM KGFRIRLIFL TLIALTIVLT YALLALVMGP
YLFPKLTSWI VLLYNVGESV GGLIGMALAD IMGMRKFTLM AYLGGFITML LVIPQLAFAP
QNLPLFLLIL SVNGVFGELG WAARVVLEPE LFPTGSRSMG VALVRAIAYV AYIASIFITA
GFTIWQYAWY NVALWSLGLL GALIWFTHGV ETRLKTLEEV NPQ