Gene Cmaq_0712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0712 
Symbol 
ID5709812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp748045 
End bp749436 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content47% 
IMG OID641275211 
Productgeneral substrate transporter 
Protein accessionYP_001540538 
Protein GI159041286 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0221085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000199551 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGTCTA CTCATTATCC CAGTAGAGCA GCATATACCG TGGTCTCTGC CTTGGGGTAT 
ATGTTGGATG GATATGACCT AAGCGTTATC AGCGTTTTCA CTTTCTCCTT ACTCAAGTTT
GGGTTCTTCA AGTATAATAG TCTAGAGCTG GGCTTTGTTA GTGGTGCGGC ATTGCTAGGG
GCAATGTTCG GTGCATTGAT TTTTGGCCAC TTCTCCGATA GACTCGGTAG GAGGTATCTA
TATACCTTTG ATCTCTTATT CTTCGTAGTA TTTGCAGCGC TCAGCGCCTT CTCCACTAAC
ATTATTCAAA TGATAATATA CAGGTTCTTT GTGGGTTGGG GTGTAGGGGC TGATTACGCC
CTGAGCCCAG TTTACGCAAC TGAAATGTAT CCAACTAACA AGAGAGGAAT GGGTTATGGT
TGGGTGTGGA CGTTCTGGAG CGTGGGTGCC TTCATAGCAT TCATATTAGG CTATGTGTTT
TACTTAGTTG ATCCAGTATA CGGATGGAGG TGGGCCTTGG GCATAGGTGC AATAATAGCG
CTGGCCACGA TAATAGTGAG GTCACTTATG CCGGAGTCTA GCCGTTGGAA GGTGGCAGTT
AAGCAGGATA CGAATGCCGT GGAAGAGGCC CGTCGGTTAT CTCAAGTCAC CGGTATGAGT
GATCAAGACA TAAGTAAGCT AGTGGAGGTT GAAGCTAAGA AGCTAGCTCA TGTGAAGCCA
GGTTCGTTTT TGGAATTATT TAAGGGTGAT TACGCGAAGA GGACGGCAAT TGTCTGGACT
CAATGGATAC TATATGACAT TGGGTCGTAT GGTTTTGGCC TGTATGCCCC ATCCATAATA
TCGATGCTTG GCTTCAAGGG CGCATCCTCC ATGTTATTAT CAGCACTACT CTACATACCG
GGGGCCCTGG GTGCACTGGG TGCTGCATTC CTCAATGATA GGTGGGGGAG AAGGATTCTT
CAATTACTTG GTTTTGGTTT CTCCACGCTT GGAATGGTAT TAGTAGCCTT AGGGGCATTA
ATAGGTGGGC TTATGGCCAT GGTTATTGGT GTCATTGGAT TAGTGTTGTG GTATGGCTTC
GGTAATTTGG GTCCGGGAAA CACCATGGGT CTGTATGCTA TAGAGCTTTT CCCAACTAAG
CTTAGGTCAA CATCGATGGG TAGTGCCACG GCCATAACCA GATTCGTATC GTTCTTGAGT
GCCTTCGAGT TCCCATACAT AGCCCTAGTC TTCGGTAAAT TATCATTCTT CGAGTTCCTG
GCAGCGATTA CCTTCGTGGC CTTCATATTT ACAATTTTCT TCACACCGGA GACTAAGGGA
ATATCGCTGG AAGACATAGC CACAGCCAAG TACAAGGGCC CTGGACTACA TCCCAGACTA
GAGGTGGAGT AG
 
Protein sequence
MKSTHYPSRA AYTVVSALGY MLDGYDLSVI SVFTFSLLKF GFFKYNSLEL GFVSGAALLG 
AMFGALIFGH FSDRLGRRYL YTFDLLFFVV FAALSAFSTN IIQMIIYRFF VGWGVGADYA
LSPVYATEMY PTNKRGMGYG WVWTFWSVGA FIAFILGYVF YLVDPVYGWR WALGIGAIIA
LATIIVRSLM PESSRWKVAV KQDTNAVEEA RRLSQVTGMS DQDISKLVEV EAKKLAHVKP
GSFLELFKGD YAKRTAIVWT QWILYDIGSY GFGLYAPSII SMLGFKGASS MLLSALLYIP
GALGALGAAF LNDRWGRRIL QLLGFGFSTL GMVLVALGAL IGGLMAMVIG VIGLVLWYGF
GNLGPGNTMG LYAIELFPTK LRSTSMGSAT AITRFVSFLS AFEFPYIALV FGKLSFFEFL
AAITFVAFIF TIFFTPETKG ISLEDIATAK YKGPGLHPRL EVE