Gene Cmaq_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0996 
Symbol 
ID5710468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1048869 
End bp1050038 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content40% 
IMG OID641275497 
Productmajor facilitator transporter 
Protein accessionYP_001540817 
Protein GI159041565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.825084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.507459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAA CAATAAATAA GATTGCGCTA CTTAATGGGA TGAGGACGCT TGGTTTCTCC 
CTCATCTGGC CGTACATAGG CTTAATATTG TATAAGTTGG GTTTACCATT CTGGTTAATC
GGCATTTACT ATAGTGCTCA GATGCTGGTT AACATTGTTT CACAGGTAAT TGGTGGTGTT
TTAACAGATA TCATGGGTAG GGTTAAATTA ATGCTTATTG GTAACATTGG GTCAAGTATG
GTTCTCATAT TAACTTACCT CCTAATTAGG TTTAATACCA TGATAGTGAT GGTATTACTG
TTGGCTCAGT CATTATTCAA CAGTATGTTT GCTGTGGCTA GTTCAACCAT TGTAGGTGAC
CTTGAGAGAG GGTTAGGTAA TTTAATTAGG GCCTACTCCA GGATTAGGGT GGGGGCTAAT
GCTGGTTGGG CCATTGGTCC ATTAATAAGT GGTTACGTAA TGTATTACTT GGGTTATAGT
TACGTGTTCT TAATAACGGC ATTAATTGCA TTAGCCTCAA CACCATTAAT ATTAATGCTA
AGGGGATTAG AGAGGAAGGT GGTATCGTTT AGGTTAATTA GGGTTAATGC CCCATTTGCC
CTATTCCTAA TACCAACAAT ACTCCTCTTC TCAACAATGG GGTTACTTGG CTTCGCCTTA
ACAACATACT ATAACGTTGT GAAGCATATT GAGATCTCAG ACATAGGGCT TGTATACGCT
CTTAACGGAT TCATGGTTGT TGCCCTACAG GATAGGGTCG GTGGCTTCAT ATCAAGGAGG
GATATTAGGT ATTGGCTTGT TTACGGTTCA TTAATCTACT ATATTTCATA TGGTGTAATA
TTCATGGTAA GTAACATCTA TGAGGCATTA CTGGACATGG CACTGGTAAC CACGGGTGAA
ATCATTGTCT CACCCATAGT TCAGGCAATA GCCATGAATA TGGCTGAAAG CGATAAGAGG
GGCCAGTACA TGGGTGTCTT CGGTTTAGCA TCCAACATTG GTAGAACCAT GGGTTCAGTA
ATGTCCAGTG AGGCTATGCA GTATATGATT AATAACCCGA TCCTACTATG GCAATCCTTA
TCATCCCCAG CCCTAGTGGC TTCATTAATC TACCTAAGCC TATTTAAGGT TAACCGTAGG
TTAATCAATA TGACTAGGCA GCTTCATTAA
 
Protein sequence
MSSTINKIAL LNGMRTLGFS LIWPYIGLIL YKLGLPFWLI GIYYSAQMLV NIVSQVIGGV 
LTDIMGRVKL MLIGNIGSSM VLILTYLLIR FNTMIVMVLL LAQSLFNSMF AVASSTIVGD
LERGLGNLIR AYSRIRVGAN AGWAIGPLIS GYVMYYLGYS YVFLITALIA LASTPLILML
RGLERKVVSF RLIRVNAPFA LFLIPTILLF STMGLLGFAL TTYYNVVKHI EISDIGLVYA
LNGFMVVALQ DRVGGFISRR DIRYWLVYGS LIYYISYGVI FMVSNIYEAL LDMALVTTGE
IIVSPIVQAI AMNMAESDKR GQYMGVFGLA SNIGRTMGSV MSSEAMQYMI NNPILLWQSL
SSPALVASLI YLSLFKVNRR LINMTRQLH