Gene Cmaq_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1542 
Symbol 
ID5709958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1623140 
End bp1624420 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content42% 
IMG OID641276050 
Productmajor facilitator transporter 
Protein accessionYP_001541355 
Protein GI159042103 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.141245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGTCAG GATACAGTAA TATACAGGTA GTGTTTAATT GGCGTAATGT AATCGGTTCC 
ATGCTTGGTT GGGGAATGGA TGCTTACGAC TTTGTTGCAT ACGATTTTGT GGCACCGGTT
ATAAAGGATT TATTCTTCGG TCAATTAGGC AGCTTAGGAT CCATGCTAGC GACACTGGCA
GTATTCAGCT TATCTCTAGT CGTTAGGCCA CTTGGTGGCG TATTCTTCGG TAATTTATCT
GATAAAATAG GTAGGCGTTA TGTACTTTAC GTAACCATGT TAGGTGCAGG AATATCCAGT
TTCTTAATGG GCTTTCTACC AACTTATGCT CAAGCCGGGT TAACCGCAAT AATACTTCTT
CTCCTTTTAA GGTTCGCGGT GGGCTTTTTC CTTGGTGGGG AATACTCATC AAGCGGTATT
ATGAGCGTGG AGAGTGTTAC TAAGTGGAGG GGGCTCGCCA GTGGCATTAT GCAGGCTGGT
TTTGATATAG GTATCTTCGG GGTCACCTTT ACATACACTA TTGTTGCAAC ATACCTACCC
AGTAGCGCAA TGAGCACCAT AGGTTGGAGG ATCGTATTCT GGAGCGGCAT AGTAACAACC
ATATTAGCCT TCATATTTAG AAGAAGGTTC CTAACGGAAT CCCCTGAATG GGAGAAGGCG
GAGAAGGTTA AGAGTCCATA TAAGGTTCTA TTTAAGAATT ATTGGTTACC AATGTTAACC
ATTCTTCTAG CAACAATGGG ATTCCTCTAC GAGTACTATG TTATGCTTGA GTTATCACCA
TTTATACTTC AGAGCGTTCT AAGTTACTCA GCTGCATTAT CCGGGTTAAT ACTAACAGTG
CTTTCAGCTA GTGACGCAGC AGGTAGTGTG TTTGGTGGTG TGCTTGTGGA TTTACTGAGA
AGCTCCAGGA GAGCCTTATT AGTAACATCA TTAATAATCC TAGTACTAAT ATATCCAACA
ATGTATGGGG TATTAATTAA TGGTAATGGC TGGCTTGTAC TACTCTGGGA TTTCCTAGCC
GTGCTACCAG TCGGGGTTAT GCAGGTTTAT ATTAGGGATC TTTTACCAGT TAACGTTAGA
TCCACAGGGG CTGGTTTAGG TTATAATGGT GGAACCTGGT TAGCTGCCTG GGCAGCATTA
ATAGCTACAT TAATGGCCTC AGCATCAACT AAGCCTCAAC CATGGTTATC ATCAATAACA
ATTAATACAT TAATAGGAGC AGTATTAATG ATAATTAGCT TCATCCTAGC GGCTAAGGCA
ATTAAATCAC ATTCACAATG A
 
Protein sequence
MGSGYSNIQV VFNWRNVIGS MLGWGMDAYD FVAYDFVAPV IKDLFFGQLG SLGSMLATLA 
VFSLSLVVRP LGGVFFGNLS DKIGRRYVLY VTMLGAGISS FLMGFLPTYA QAGLTAIILL
LLLRFAVGFF LGGEYSSSGI MSVESVTKWR GLASGIMQAG FDIGIFGVTF TYTIVATYLP
SSAMSTIGWR IVFWSGIVTT ILAFIFRRRF LTESPEWEKA EKVKSPYKVL FKNYWLPMLT
ILLATMGFLY EYYVMLELSP FILQSVLSYS AALSGLILTV LSASDAAGSV FGGVLVDLLR
SSRRALLVTS LIILVLIYPT MYGVLINGNG WLVLLWDFLA VLPVGVMQVY IRDLLPVNVR
STGAGLGYNG GTWLAAWAAL IATLMASAST KPQPWLSSIT INTLIGAVLM IISFILAAKA
IKSHSQ