Gene Cmaq_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0821 
Symbol 
ID5708764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp861512 
End bp862723 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content40% 
IMG OID641275324 
Productmajor facilitator transporter 
Protein accessionYP_001540646 
Protein GI159041394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATA AGGATACATA CATTGCCATA CTGGCTGGAA TGGGTGGTTT CACGGATGGT 
TTCGCATTAC TACTGGGTGG TGCAGCATTA CTATCATTAA GTCATTACTT TAAATTAACC
CCTGGGATTG AGGGATTAAT AATATCAATG CCGTTCATAG GTTCAGTTAT TGGTTCATTA
ATCTTCGGTA GATTAGCCGA TTTATTAGGT AGGAGGGCAA TATTCCTAAA CGTCCTACTC
TTCTTCGTCT TAGGTTCACT TATAAGCGCC GTAGCCTACA ATACGCTACT CATAATTATT
GGTAGATTAC TAGTAGGTAT TGGAATAGGG GGTGATATAC CAGCTGGCGC CAGTCTAGTG
GCTGAATTAT CTAATAAGGG TAGTAGAGGT AGGTTAATTT CAATACAAAG CATGTTATGG
GGACTCGGTG GTTTTGCGGC GGCGTTGGTT GCATTACCAT TACTGAACCT AATGGGTGCT
GAATCATGGA GGTTAATATT CGGGCTAGGT GCAATACCAC CTTTAATAGT GTTAGTCTTA
AGAAGAAGCA TTAATGAGAG TTGGGTATGG GAAATCAAGA GGAGAAATAG CAGTAATATT
AAGTTAAGTA ATCGTAGTAG GTATACATTA GTCTTAGCCT TCACCGCAGC GTCGTTATTT
GTATGGACAT TTATCTTAGC CATATTCGCC AATTATACAC CCACCATATT AGTTAATGCA
ATGCAGCTTA ATAAGGCTAT GGCATTACTC ATAAGTGGAC TTCAGTGGAT TGGCTTCGTA
GCTGGTGGAT TAATCGTATT CAAGTACTGT GATAGGCTTG GTAGAAGAAC ACTCATAGTA
CCTGCGACAG TGGGTGAGGC TGCATTACTT TACTTATCAA TAATAATGGT CAAGGATCCA
GTGACACTAT CACTAATACT AATAACTCTA TGGGTACTGG GGGGAGTCGG CTACATTGTG
AACAGCATAT ATTCAGCGGA ATTATTCCCA ACACTACTCA GGGGTACATC AAACGGTATA
AGTTTCTCAG CAGGGAGACT GGGTGGGTAT ATAAGTACGC TTATATTACC ATCACTGCTG
TTGAGTATTG GATTAAGTAA AGTATTCTTA ATCCTTGCAG TCATGATAAC CCCATTAATT
GCAACCTGCA TAGCCACTGC ACCACGTAGT GAGAATAAGA GCTTAGAGGA ATTAGAAAAA
GACTACTTCT AA
 
Protein sequence
MVNKDTYIAI LAGMGGFTDG FALLLGGAAL LSLSHYFKLT PGIEGLIISM PFIGSVIGSL 
IFGRLADLLG RRAIFLNVLL FFVLGSLISA VAYNTLLIII GRLLVGIGIG GDIPAGASLV
AELSNKGSRG RLISIQSMLW GLGGFAAALV ALPLLNLMGA ESWRLIFGLG AIPPLIVLVL
RRSINESWVW EIKRRNSSNI KLSNRSRYTL VLAFTAASLF VWTFILAIFA NYTPTILVNA
MQLNKAMALL ISGLQWIGFV AGGLIVFKYC DRLGRRTLIV PATVGEAALL YLSIIMVKDP
VTLSLILITL WVLGGVGYIV NSIYSAELFP TLLRGTSNGI SFSAGRLGGY ISTLILPSLL
LSIGLSKVFL ILAVMITPLI ATCIATAPRS ENKSLEELEK DYF