Gene Cmaq_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1968 
Symbol 
ID5708442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp2044154 
End bp2045629 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content47% 
IMG OID641276478 
Productmajor facilitator transporter 
Protein accessionYP_001541774 
Protein GI159042522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.632276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAATTA ATAATGCGGA GTATGACCTT AAGTACGCCT ATAGGGCAAT GGTCATACTT 
GCCTCATTAG CGGTAATAGT AATGTATATA GAAGGCATGT TAATACCGTC ATTAACTGAA
ATAGAGAGGG AGTTCGGGGT AACCTCAAGT CAAGTTAGTT GGGTCCTCTC ATCATACCTA
CTCTCCGGTA CTGTTCTACT ACCAATAGTG GGTAGGCTTG GTGACATTTA CGGTAAGAAG
AGGGTTCTCT CAGCAGTGGT CATAATATAC GCTGTGGCAG TCACATTAAC CAGTGTATCA
CCCAGTTTCA CGTACTTAAT AGCCTTCAGG GGTATTCAAG GTATTGGAGT AACCATGTTC
GCATTAGCCT TCAGTCTAAT CAGGGAGGAG TTTCCAAGGG AATTAATACC AAGGGCTCAG
GGATTGGTGA GTGCAGCCTT TGGAATTGGT GCGGCAATAG CACTACCCCT GGGCGCATAC
ATAAGCCAGT ACTTTGGTTG GAGAACAACA TACCACACAG CCATACCCTT CGTACTGCTG
GTGGCGTACC TAATAGTGAC TAGGATAAAG GAGTCAAGGT ACAGGAACCC TAGTGCTAAG
GTTGATTTAC CTGGGGCAGC GGTACTTGGA ATTGGATTAG GCCTGGTGGT TTACGGATTA
ACCGAGGCAC CCATATGGGG TTGGACTAAC CCGAACACGA TAATAACCTT CCTAGCGGCC
CTCATATTCA TAGGAGCCTT CATAGCCGTA GAGAGGAGGA GGGAGCAGCC GTTAATTAAC
CTATCATTAT TAACTAGGAG AAACGTGTTA ATAGCTAATC TAGCCGCAAT GGTGGCTGGC
TTCGGCCTCT TCCTATTTGA ACAAAGCCTA ATAATACTCC TCGAGGAGCC TAAGCCCGTT
GGCTTCAACC TATCCATATT CGATACCGGC TTATACGCAA TCCCCATGGC TGTGGCGCAG
TTAATAGTCG CCCCAGTTGC AGGCATATTA ATAACTAGGA TAGGGGCTAG GAGAATGCTC
ATGACTGGGG CAAGTATAGC CGCCTTATTC AGCCTAATAA CCGCCGCCGT GGCCCCCCTG
GGTTTAGGGG CTTTGATAAC ATCAACAACA TTAGCCATGG CTGGGGTAGC GGCAATGAAT
GTATCCCTCA TTAATATCCT TGTTTTCTCA GTGGAACCGC AGGTAATGGG GGTTTCAACA
GCAATGAACT CAGTCTTCAG GAACCTGGGT GGTACCCTAG GCCCAGCGGT GGCTGGTTCA
CTTGAGTCAA CATTCACATC ACTGGTTCTA ATGGGTATAC TGCCGGGGCG TAATGTGCCG
CTCTTAGTTA CAGTGCCATC AATGTACGCC TTCCAGATTG GTGCAGTAAT CTCAGCCTTA
ACAGTGGTAA CGATAGGTAT ATTGGCTTAC TTCTCAGTGG AGGTCATAAC CTGGAGAAAT
GAATCCCAGA CTGTCGCCTC ATTAAGCCAG GAGTAG
 
Protein sequence
MLINNAEYDL KYAYRAMVIL ASLAVIVMYI EGMLIPSLTE IEREFGVTSS QVSWVLSSYL 
LSGTVLLPIV GRLGDIYGKK RVLSAVVIIY AVAVTLTSVS PSFTYLIAFR GIQGIGVTMF
ALAFSLIREE FPRELIPRAQ GLVSAAFGIG AAIALPLGAY ISQYFGWRTT YHTAIPFVLL
VAYLIVTRIK ESRYRNPSAK VDLPGAAVLG IGLGLVVYGL TEAPIWGWTN PNTIITFLAA
LIFIGAFIAV ERRREQPLIN LSLLTRRNVL IANLAAMVAG FGLFLFEQSL IILLEEPKPV
GFNLSIFDTG LYAIPMAVAQ LIVAPVAGIL ITRIGARRML MTGASIAALF SLITAAVAPL
GLGALITSTT LAMAGVAAMN VSLINILVFS VEPQVMGVST AMNSVFRNLG GTLGPAVAGS
LESTFTSLVL MGILPGRNVP LLVTVPSMYA FQIGAVISAL TVVTIGILAY FSVEVITWRN
ESQTVASLSQ E