Gene Cmaq_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1049 
Symbol 
ID5710089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1099685 
End bp1100635 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content42% 
IMG OID641275549 
Productprotein of unknown function RIO1 
Protein accessionYP_001540868 
Protein GI159041616 
COG category[T] Signal transduction mechanisms 
COG ID[COG0478] RIO-like serine/threonine protein kinase fused to N-terminal HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.290796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000362034 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCATAA GTAACGTAAT AGCCTCCTAC AATGAGTTAA GTAAACTCGA CTTAAGGGTG 
CTTAGGGTAA TTGAGGTCCT CCACAGGAAT CACGAGTACG TTCCGGTTAA GAGGATTGTG
AATTACATGG GTTTAAGTGA GGAGGTTATT GATAAGTCTA TTTCAAAGAT GAATAAGCTT
AAACTACTGG TTAGGAGGGG GCCTGATAAC GTTAGGTTAA CATTCCCAGC CTACGACATA
CTGTCAATAC ACACCATGGT TAAGAAGGGT GTTATAGATG CCATAGCCCC AACACCCCTT
GGTGTTGGTA AGGAATCAGA CGTATACGCT GCTGATGCGC CAAATGGGGA AAAATACGCC
TTAAAGTTCC ATAGGATTGG TAGAGTTAGT TTCAGGAATA CTAGGAAGTA TAGGGTTTGG
ATTGGGGAGA GGAGGCATGT TACTTGGCTT TACGAAGCTA AGATATCAGC ACACATGGAG
TACCTAGCGT TAACCGAAGC CTATAAGGCT AAGGTACCTG CACCAAGGCC TAGGGCTGTG
ACAAGGCACT TGGTGGCCAT GGAGTACGTT AATGGTGTTG AATTATTTAG GGTTAAGTTA
AGTAATCCTG AGGATGTTCT TGAACAGATT ATTTCAGCCA TTGAGGATCT CCTGAGGATA
AACATTATTC ACGGTGACTT AAACGAATAC AATATCCTAG TTAATCCAAG TGATGAGAAA
ATAACAATAA TAGATTGGCC CCAGTGGATG TACGCTAACG TTAAGGGATC TAGGGTAATC
CTAATGAGGG ACCTCAACAT TATACTGAGG CACTTTAAGT CAAACTACGG GTTAAACGTA
GGCATTGATG CAGTTATGAG TAGGCTAGCC CCATTAATAC CGAACAGTGA ATTACCACCT
GAGAAGGCGT ACTCCAGGTT AATTAAGAGA GTAACATCCT TAGTTAAATG A
 
Protein sequence
MSISNVIASY NELSKLDLRV LRVIEVLHRN HEYVPVKRIV NYMGLSEEVI DKSISKMNKL 
KLLVRRGPDN VRLTFPAYDI LSIHTMVKKG VIDAIAPTPL GVGKESDVYA ADAPNGEKYA
LKFHRIGRVS FRNTRKYRVW IGERRHVTWL YEAKISAHME YLALTEAYKA KVPAPRPRAV
TRHLVAMEYV NGVELFRVKL SNPEDVLEQI ISAIEDLLRI NIIHGDLNEY NILVNPSDEK
ITIIDWPQWM YANVKGSRVI LMRDLNIILR HFKSNYGLNV GIDAVMSRLA PLIPNSELPP
EKAYSRLIKR VTSLVK