Gene Cmaq_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0449 
Symbol 
ID5709777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp482871 
End bp486266 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content47% 
IMG OID641274952 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_001540284 
Protein GI159041032 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000490732 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAGTG TTAGGAGGGA TTCTCAAACG TCATTACTGC CATTGCCGAT ACTCCGAGCG 
GAGGATTCAG GAGGATACAT AGCCAGTGAG GATAGGTGGA CTCTTGTTGA ATCATACGTG
AGGAGTTATG GGTTGGTTAG GCATCAGATA GATTCATTTA ATGATTTCGT GGATAGGAAG
CTTAAGGAGA TAGTTCAGGA ATTTAACATA GATCTGGGTG ACGTTAAGGT TAAGTTCATT
GATGTTGAGG TTGGGAAACC GAGGTTTAAG GAACCCACTG GGGTGGAGAA TGTAATATAC
CCAATGGAGG CTAGGTTAAG GAACATAACC TACGCAGCAC CCATGCGGCT TAAGGTCATG
CTCTACATAA ATGGTGAAGA ATACACTGAG ACTGTTCCCC TAGGTGACTT ACCTATTATG
GTTAAATCAA AGTACTGTAA CCTATACGGC TTAAAGCCCC AGGCCATAGT TAAGAAGCTT
GAGGACCCAA ATGACCCAGG TGGCTACTTC ATAATAAACG GTAGCGAGAG GGTTGTGGTT
TCCCAGGAGG ACCTGGCTCT TAATAAGCCA ATATATGATT ACGATGAGAG GGGGGCCACA
AGGATACCTA GGGCTAAGAT AATATCCATG GGTCCAGGCT ACAGGACTAC GGTGGTTGTT
GAGTACCATA AGGATGGTGT AATTTACGTC CAGATACCCA AGATACCCAC TAGAATGCCC
TTCCCAGTGG TCATGAGGGC CCTCGGCCTT GAGAAGGATC AGGACATTGC TCTAGCTGTG
AGTGATAATG ATGAGATTCA AAGGGAGTTG CTTGCATCAT TCGAGATGGC TATTCAAATA
GCGCCAACCG TTGATGAGGC TAGGGACTAC ATAGGTAGGA GGATTGTGCT TGGTCATCCC
AGGGAGATAA GGATTCAGAG GGCCCTTGAA TACCTTGATA AGTACTTCCT ACCTCACCTG
GGCACTACGC CTGATGATAA GGTTAGGTTA AGCAAGGCCA TTGAGCTTGG TCAAGCAGTG
GCTGGTGTTA TTGAGCTTTA CAAGGGTTGG AGGCAGCCTG ATGATAAGGA TCACATGTCT
AATAAGAGGG TTAGGCTTGT CGGTGACTTA CTGGCCCAAT TATTTAGGTC AATATTCGCC
CAGTTTGTTC AGGATCTTAG GAATCAATTG GAGAAGCAGT ACTCAAGGGG TAAGATACCT
GAATTAAGGA CAATAGTTAG GGCTGATATA ATTAGTGATA GGCTTAAGCA CGCCTTATCC
ACAGGTAATT GGGTTGGTGG AAAGACTGGG GTCACGCAGA TGCTTGATAG GACTAACTAT
GTTTCAACAA TAAGCCACCT AAGGAGGGTT GTTTCATCAT TAAGCAGGAC TCAGCCTCAC
TTTGAGGCAA GGGACCTTCA CCCAACTCAA TGGGGTAGGT TATGCGCCAT AGAGACCCCT
GAGGGGCAGA ACTGTGGTTT AGTTAAGAAC ATGGCCCTTA TGGCCACTGT AACAGTGGGT
GTTGATGAAG CCAGCGTGGA AGCTATGTTG AGGGAAATGG GTGTTATTAA TGTTCTTGAC
GCCAGAAGGA ATGAGATTAA GGGGGCTAAT GTACTGCTTA ACGGTAGGTT AATAGGTATT
CATAGGGATC CACAAGCCCT CGTAAACGCC ATTAGGGAGG CCAGGAGGAG GGGTGACATT
AATGGTGAGG TTAACGTTGG TTACATTGAG AAGCTTAATG AGGTTAGGGT TAACTGTGAC
GGTGGTAGGT TAAGGAGGCC ACTGTTGATT ATAAGTAATG GTAAGTTAAG GTTAACTAAG
GAGCATATCG AGAAACTGAG GAGTGGGGAG TGGACTTGGG ACGACTTGGT TAAGAATGGT
ATAGTGGAGT ACCTTGATGC TGATGAGGAG GAGAACGCCA TGGTAACCAT AGGTGATACT
AAGGATGTTG ACTTAAGCAA GTACACGCAC ATGGAGATTA TACCAAGCAT AATGCTGGGC
GCTGTGGCGC ATATAATACC TTACTCTGAG CATAATCAAT CACCCAGGAA CATTTATGAG
GCAGCCATGG CTAAGCAATC CCTAGGATTC CCCTACTCTA ACTACAGGTA TAGGATTGAC
AGTAGGGGTC ATTTACTACT CTACCCTGAA AGACCCCTGG TGACCACAAG GGGTCTTGAG
CTAATAGGCT ACAGTATGAG GCCATCAGGT CAAAATGCAG TGCTTGCGTT AGTGTCATTC
ATGGGTTACA GTATAGAGGA TGCAGTCATG ATTAATAAGG CAGCCATAGA GAGGGGTATG
TTTAGAAGCA TCTTCTACAG GTCATATGAA ACCGAAGCCA TGAGGTACCC CATGGGTGAG
AACGATAGGA TAACCATACC ACCACCCACA GTTAGGGATT ATAGGGGTGC TGAGGCCTAC
GCCCACCTGG ATGAGGACGG CATAGTGCCT CCAGAGGTCT TTGTATCAAG TAAGGAGGTT
TTAATAGGTA AGATAAGCCC ACCAAGGTTC TACAGTGCAT TGGCTGAGAG TCAACTTGCC
GGTGAGTGGA AGGATAATTC AATAACAGTG AGGAGGGGTG AGAAGGGTAT TGTTGACCAA
GTCCTAATAA CTGAAAGCAG TGAGGGCTTT AAGTTAATTA AGGTTAAGGT TAGGGAACTT
AGGATACCTG AATTAGGTGA CAAGTTCGCT TCAAGGCATG GGCAGAAGGG TGTAGTGGGT
ATGATAGTGC CCATGGAGGA TATGCCCTTC ACTGAGAACG GTATAACACC GGATGTGGTT
ATTAATCCGC ATGCACTACC AAGCAGAATG ACCATTGGGC AATTGCTTGA GGCCATAGCA
GGTAAGACTG CTGCACTATA CGGTACACTG GTTGATGCAA CACCCTTTGA GGGTGTTCCT
GAAGGCACTA TTAGAAGCAT GTTAATGAAG GCTGGGTACA GGTGGAGTGG TAGGGAGACG
ATGTACAGTG GGTTAATGGG TACTAGGCTT GATGCAGATA TATTCATTGG AGTAGTGTAT
TACCAGAAGC TACACCACAT GGTTGCTGAT AAGATACATG CCAGGGCAAC TGGACCAATG
CAGATACTGA CCAGGCAACC AACAGAGGGT AGATCCAGGG AGGGTGGGTT GAGGCTTGGT
GAAATGGAGA GGGATGTCTT AATAGCCCAT GGAGCCGCAG CGCTGCTTCA CGAGAGGATG
GTTGAGTCAA GTGATAAGTA CGTGATGTAC GTGTGCGAGG ACTGTGGTAT GATGGCTTGG
TGGGATGACT TCAAGAAGAG GCCAATATGC CCAATACACG GTGATAAGGG GAGGATAGCC
AAGGTCATTG TCCCCTACGC CTTCAAGCTA CTGCTACAGG AGTTAATAAG CCTAGGCATA
TACCCGAAGC TTAAGCTCTC TGAACCCATA GGATGA
 
Protein sequence
MSSVRRDSQT SLLPLPILRA EDSGGYIASE DRWTLVESYV RSYGLVRHQI DSFNDFVDRK 
LKEIVQEFNI DLGDVKVKFI DVEVGKPRFK EPTGVENVIY PMEARLRNIT YAAPMRLKVM
LYINGEEYTE TVPLGDLPIM VKSKYCNLYG LKPQAIVKKL EDPNDPGGYF IINGSERVVV
SQEDLALNKP IYDYDERGAT RIPRAKIISM GPGYRTTVVV EYHKDGVIYV QIPKIPTRMP
FPVVMRALGL EKDQDIALAV SDNDEIQREL LASFEMAIQI APTVDEARDY IGRRIVLGHP
REIRIQRALE YLDKYFLPHL GTTPDDKVRL SKAIELGQAV AGVIELYKGW RQPDDKDHMS
NKRVRLVGDL LAQLFRSIFA QFVQDLRNQL EKQYSRGKIP ELRTIVRADI ISDRLKHALS
TGNWVGGKTG VTQMLDRTNY VSTISHLRRV VSSLSRTQPH FEARDLHPTQ WGRLCAIETP
EGQNCGLVKN MALMATVTVG VDEASVEAML REMGVINVLD ARRNEIKGAN VLLNGRLIGI
HRDPQALVNA IREARRRGDI NGEVNVGYIE KLNEVRVNCD GGRLRRPLLI ISNGKLRLTK
EHIEKLRSGE WTWDDLVKNG IVEYLDADEE ENAMVTIGDT KDVDLSKYTH MEIIPSIMLG
AVAHIIPYSE HNQSPRNIYE AAMAKQSLGF PYSNYRYRID SRGHLLLYPE RPLVTTRGLE
LIGYSMRPSG QNAVLALVSF MGYSIEDAVM INKAAIERGM FRSIFYRSYE TEAMRYPMGE
NDRITIPPPT VRDYRGAEAY AHLDEDGIVP PEVFVSSKEV LIGKISPPRF YSALAESQLA
GEWKDNSITV RRGEKGIVDQ VLITESSEGF KLIKVKVREL RIPELGDKFA SRHGQKGVVG
MIVPMEDMPF TENGITPDVV INPHALPSRM TIGQLLEAIA GKTAALYGTL VDATPFEGVP
EGTIRSMLMK AGYRWSGRET MYSGLMGTRL DADIFIGVVY YQKLHHMVAD KIHARATGPM
QILTRQPTEG RSREGGLRLG EMERDVLIAH GAAALLHERM VESSDKYVMY VCEDCGMMAW
WDDFKKRPIC PIHGDKGRIA KVIVPYAFKL LLQELISLGI YPKLKLSEPI G