Gene Cmaq_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1609 
Symbol 
ID5708660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1680865 
End bp1682097 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content45% 
IMG OID641276117 
Productmajor facilitator transporter 
Protein accessionYP_001541422 
Protein GI159042170 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.655483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA GAAGAGATTT GATACTTATT TTATCATCCA GGGTTCTTAG AAGTATTGCT 
ACGGGTGCTC TTGGTGTCAC CACGGGGCTT TACCTATATA ATGTGCTTCA CCTATCAGCC
ACCTTAATAG GCGTATTCTT TGGTGTTGGT GCCTTCGCAA CACCTTTAAT GAGCCTATAC
TTCGGTAGGC TTGGTGACTG TTATGGTAGG AAGAGGATGC TATTAATAGC ATTACTATTC
CTACCAGCTG CAACAGCAAT ACTACTATTA ACCTCAAACT ACGCCCTACT ACTAGTTGCA
GCGGCATTAG GTGGATTCGG TACTGCTGGG GCTTTAGCCA GTGGTAGTGT TGGGGCTATT
GTTGCCCCAA TGATGACTGC ATTATTGGCT GATAAGACTA ATGAGGAGAA TAGAACCATG
GTGTATTCAC TACTTAACCT AGCCTCAGGT CTAGCTGGGG CGGTGGGTGC ATTATTGGCT
CACTTAAGTT ACAGGGAGGG CTTCATGATT GCCCTAGGGC TTTCAACGGC ATCATTCCTA
GCAATACTGC CCGTTAGGGA TAGGTACAGT GAGAGTGCGG TTAAGGGTAG GTGCGGTTCA
GTTAACGGTA GGTTGGATGA GAAGGACAGG AGGGTTATTA GGAGGTTCGT ATTGACTGGT
GCCTTTAATG GTATTGGACA GGGCTTAGTG ACACCATTCC TACCCATAAT CTTTGAAATA
CTCCTCAAGA TACCTAAGGG TGAGATAGGG AATATCTTCT TCCTGGGTGG CGTGGCTGCT
GCATTAATAT CACTGCTAAC GCCAGTCATA ACCAGTAGGC TCGGCTTCGT TAGGACTGTT
GTTTTAACCA GGTCAATATC CACTGCGGCC TTAGTAACAT TACCCTTCGT AAATCGCTTT
TCCCCGGTGT TGAGTTATGA TGTGGCCATA GCCATGGTAG CCTACTTAAT TTACGTAATG
TTCAGGGTAG TATCCCTACC TGCTCAGTCA GCTTTAATGA TGAGTCTAGT TAGTCAGGGT
TCAAGATCAA CCACTGCCGG TGCTAATCAA GCGGCTAGGT TGCTTCCATC AGCGGCGGCG
ACATTATCCA GTGGAGCCAT GATTGATTAC GTAGCCTTAC CTGTCCCATT TATAATCGCT
GTGATTATTA ATGGAGTTAA CATATACCTG TATACAAGGT TCTTTAAAGA TGTTAGGACA
GGCCGTGGTG TAAGAAGTAT AATAGTTGAG TAA
 
Protein sequence
MSDRRDLILI LSSRVLRSIA TGALGVTTGL YLYNVLHLSA TLIGVFFGVG AFATPLMSLY 
FGRLGDCYGR KRMLLIALLF LPAATAILLL TSNYALLLVA AALGGFGTAG ALASGSVGAI
VAPMMTALLA DKTNEENRTM VYSLLNLASG LAGAVGALLA HLSYREGFMI ALGLSTASFL
AILPVRDRYS ESAVKGRCGS VNGRLDEKDR RVIRRFVLTG AFNGIGQGLV TPFLPIIFEI
LLKIPKGEIG NIFFLGGVAA ALISLLTPVI TSRLGFVRTV VLTRSISTAA LVTLPFVNRF
SPVLSYDVAI AMVAYLIYVM FRVVSLPAQS ALMMSLVSQG SRSTTAGANQ AARLLPSAAA
TLSSGAMIDY VALPVPFIIA VIINGVNIYL YTRFFKDVRT GRGVRSIIVE