Gene Cmaq_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1390 
Symbol 
ID5709424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1467048 
End bp1468328 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID641275901 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_001541206 
Protein GI159041954 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.122511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGAGT TTAACCTAGG GGCAGACTTC GTGGACTTCG CCGCTGACGT AATGTACATT 
ATTGAGGATA GGTTAGTGTC AATGGATAAG GCATTCCACT ACGCCAGGCT TAGGCATAGG
TTGAAGGCAC CGCTTAGGGT TTACTATAAT GCGGTTAGTG ATGTTGTTAG GAATTACGCC
TACTTATCCT TCATGGCTAA GCAACTGTTG GGTTCAAGCT CAAGGAAGGC TATAGCCAAG
ACCTGGCTAC TCCTAAACAC TAATGACCAC TACTCCAGGA GGCTTAGGAA GAGGGTGAGG
GGTAGGGTTG AGGGCGCTGA GGCTAGGTTA AGTGAGGTTA AGGATAATGA CCCGTTAACG
TACCTATCCA TTAAGTACTC CTTCCCAAGA TTCATAGTGG AGGAGTTGAG TAGGGGAATG
GGGCTTAGTG AACTTGAGGA TTACTTATCA TCACTCAACA GGAGGGTTAC TTGGCTTAGG
GTTAATACGC TTAAGGTGGA TTTAGATAAG GCCATTAGGC TCCTTGAGGA TGAGGGGGTT
GAATTCACTC AAAGTAGACT ATACCCATTC ATGCTACTGG TTAAGGGTTA TAGGAGGCCA
ATGGGTTACT TAAGGCTATT TAAGGATGGG GCTGTGGTTC CCCAGGACTT GGCGTCGGCA
TTAGTGGTAC TCAACCTAAT GCCTGAACCC GGGGACGTGA TTATTGATGC CTGCGCCGCC
CCAGGTATGA AGACTAGTCT AATAATGCAG TTAACTGATA ATAAGGCTGA GGTCATTGCT
ATTGATGTTT CTAAGAATAG GTTGAGTAAA ATGAGGTCAA TATTAAGGAG AATGGGTGTT
GATGACTCAA GGGTGCATAT AATGCGTTCA GACTCAAGTA GATTAAGGTT AACTGGGGTT
AATGTTAATA AGGTGCTTAT TGATGCACCA TGCACCTCAA GCGGGGCAGT CTCAAAGGAT
CCGGGAATTA AACTAATACT AGCCAGTAAT CCAGGCTTGG TTAAGCGTCA ATCACTGGTG
CAGTCATCAA TACTACTTAA CTTAATTAAC CAGCTTAAGG ATGCATCAAT AGTATATGCT
ACCTGCTCAA TACTACCTGA GGAGGGTGAG GAGGTTATTG AGAGAATTAA CTCATCAAGT
AGTGTTAGTT TAGTTAAGCC CAGTGTGGGT GATTTAAGTA ACGGTTACGT GAATTACCCT
GTATCAAGCG TTGTGGGTAG GGTAATGCCC CATATTCATA ATGCTGAAGG CTTCTTCATA
TCAAAGCTCA CCATTAACTA G
 
Protein sequence
MVEFNLGADF VDFAADVMYI IEDRLVSMDK AFHYARLRHR LKAPLRVYYN AVSDVVRNYA 
YLSFMAKQLL GSSSRKAIAK TWLLLNTNDH YSRRLRKRVR GRVEGAEARL SEVKDNDPLT
YLSIKYSFPR FIVEELSRGM GLSELEDYLS SLNRRVTWLR VNTLKVDLDK AIRLLEDEGV
EFTQSRLYPF MLLVKGYRRP MGYLRLFKDG AVVPQDLASA LVVLNLMPEP GDVIIDACAA
PGMKTSLIMQ LTDNKAEVIA IDVSKNRLSK MRSILRRMGV DDSRVHIMRS DSSRLRLTGV
NVNKVLIDAP CTSSGAVSKD PGIKLILASN PGLVKRQSLV QSSILLNLIN QLKDASIVYA
TCSILPEEGE EVIERINSSS SVSLVKPSVG DLSNGYVNYP VSSVVGRVMP HIHNAEGFFI
SKLTIN