Gene Cmaq_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1395 
Symbol 
ID5709453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1473398 
End bp1474714 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID641275906 
Productmajor facilitator transporter 
Protein accessionYP_001541211 
Protein GI159041959 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.351043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCAA TGAGAACCAA GGCAATAATT AGTACAACCC TTGGTATCGC CTTTGAGTGG 
TATGACTTCT TCCTATACAG TTTACTAGCC CCGGTGATAG CGCAAGTGTT TTTCCCAAAG
ACTATACCTG TTTTATCACT GGCTTATGCC TATGTTGTAC TCTTCATAGG ATTCGTGGGT
AGGCCGGCTG GGGGCTTGAT CTTTGGTTAT ATTGGTGATA AATTTAGTAG GATACGTGCA
CTATACTTCA CATTGCTCAT CGCAGGGATA TCAGTATTAT TAGTTGCAAT ATTACCTACG
TATCAACAGA TTGGTGTAGC GGCACCAATA TTATTAGCAA TACTGAGATT TGCTGATGGC
ATAGGGCTTG GTGGCGAATG GGGTGGTAGT TTCTCCCTAA CCTCAGAGTA CATAAATCCA
AATCTAAGGG GCTTTTTCTC AGGTCTTCTC CAGGCTACCG TACCCGTGGC ATCTTTATTA
GTAAGTGGAT TCACACTATT ATTCACCTCA CTGCTTGGTG AAAGCGGCTT CTACGCTGTT
GGCTGGAGGT ATGTCTTCGC AATAGGCTTC ATCATATCAA TAATCGGCGT CTTCATAAGA
TTTAGGGTTG CTGATTCCCC AGTTTTTCAA AAACTCGTGG AGACGGGCAG GGTGGTTAAA
AACCCAATCT CCGGGGCGTT CAGGAGGTAT TGGAAATTAA TCCTAATGGG TTTATTCCTA
GTAGGCATAG TAAATGGGGC TTATTACTAC CTAAACTTCG CCTTTGCACT GGGTTACGCA
ACAACCATTG CTAAGGCATT TCATAAACCC TACGTACCCT ACTCCGTGGT CTCAGAGGGA
GTATTAATAT CCTCCCCAGT ATTGATAATA CTTGCACTAG CCTTCGGCTA TCTATCAGAT
AGGATTGGTA GAAGACCCTT AATATTAGCG AATGCAGTAG GCGCAATTGT TTTCATAGCG
CCATATTTAC TAATGCTACT AAGCGGCGAC CCCACGCTTG TTATGAGCGC AATAGTGCTT
GGTGGATTAA TTTTCTGGTT GATTTCAGGT GCCATAACGC CCATAGTACT TGTTGAAATG
TTCCCACCTG AGGTTAGGTA TACTGGTATT TCCACTGCTT ATCAAATCGG TGTGGGATTC
ATAGGTGGTT TATCACCATA CATACTAACA TTCATGATAT CAGCATTACA TGATATCTTC
TGGCCACCAC TTATCTATAC AGTGGTCCTG GGATTAATAG TCCTATTCAT AGGCATAGTA
CTGGGTGAAA CCAAGGGAAG ACTACACGTG GGTGAGGAAA TCCTAAGACA GCAGTGA
 
Protein sequence
MVSMRTKAII STTLGIAFEW YDFFLYSLLA PVIAQVFFPK TIPVLSLAYA YVVLFIGFVG 
RPAGGLIFGY IGDKFSRIRA LYFTLLIAGI SVLLVAILPT YQQIGVAAPI LLAILRFADG
IGLGGEWGGS FSLTSEYINP NLRGFFSGLL QATVPVASLL VSGFTLLFTS LLGESGFYAV
GWRYVFAIGF IISIIGVFIR FRVADSPVFQ KLVETGRVVK NPISGAFRRY WKLILMGLFL
VGIVNGAYYY LNFAFALGYA TTIAKAFHKP YVPYSVVSEG VLISSPVLII LALAFGYLSD
RIGRRPLILA NAVGAIVFIA PYLLMLLSGD PTLVMSAIVL GGLIFWLISG AITPIVLVEM
FPPEVRYTGI STAYQIGVGF IGGLSPYILT FMISALHDIF WPPLIYTVVL GLIVLFIGIV
LGETKGRLHV GEEILRQQ