Gene Cmaq_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1964 
Symbol 
ID5708461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp2040074 
End bp2041303 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content44% 
IMG OID641276473 
ProductGntR family transcriptional regulator 
Protein accessionYP_001541770 
Protein GI159042518 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGTT GCGTGAGTAT GCATATGAAT GTGAAGGATT TACTCTCAGA GCGGACTAAC 
TACATGAAGG CCAGTGACAT AAGGGAGCTC CTCAAATGGG CCACGGCTGA CGTAATATCC
TTCGGAGGGG GTTTACCGGA TTTATCCCAA TTACCACTTA ATGAGTACTC TGAGGTAGCT
AAATTCGTTG TAAGTAACTA TGGTTTAAAG GCGCTTCAAT ATGGTAAAAC AGAGGGGGTT
GATGAGCTTA AGGAGGAGTT GGCTAAGTTC ATGGCTAAGC AGGGCATTAG GACTGATGCA
TCATTAATAC TACCCACTGT GGGTAGCCAG GAGGCGCTTG AACTCATGGC AAGGGTCTTC
ATTGACCCAG GTGACGTAAT AATTACTGAG AAACCAACCT ACTTTGCTGC ATTACAGGCC
TTTAGGGTTT ATAGGGCTAG GATAATTGGT GTCGATATGG ATAATGATGG ATTAATCATT
GATAAGCTTG AGGACACGAT TAAGAGGCTT AGGAGTGAGG GCGCTAGAGT AAAGTTCATA
TACACCATAC CCATTTGCCA AAACCCCACT GGGGTAATGA TGAGTATTGA TAGGAGGAAG
GCGCTGCTTG AGTTAGCCAG TAGGTATGAT TTAATGATTC TTGAGGATAA TCCATACAGT
TACTTCACCT TTGACCCAGT GGATACAACA CCCCTAAAGG CCCTTGATAA TGAGGATAGG
GTCGTGTACA CGTCAACGTT CAGTAAGATA ATAGCCCCAG GCATTAGATT GGGTTGGGTT
GTGGCTAATC AGGATATAGT GAATTGGTTA GCCATAGCTA AGCAGGCAAT GAATCTACAT
ACCCCAACCC TAAGCCAATA CATAGCCTAC GAATTACTGA GAAGAGGTAT AGTGGATAGG
TATATTTCTA AGATTAAGGA AACCTATAAG GTTAAGAGGG ATGCAATGCT TGACGCATTA
TCCAGGTACA TGCCGAGTGG GGTTTCATGG ACTAAGCCAA GTGGGGGAAT GTTCATATGG
GTTACCGTCC CCGGCAACGT GAGGACTGAG GATATGCTTA ACTTAGCTAT AAGAAAGTAT
AAGGTTGCCT ACGTACCGGG TAAATCATTC TACCCAGATG AGGATGTTCA TAATGATATG
AGACTGAACT TCACCTACCC ATCAGTGGAC CAGATATATG ATGGTGTTAG GAGACTGGCA
CTGGCCATAG AGGAATATAA AGGCAAGTAA
 
Protein sequence
MFSCVSMHMN VKDLLSERTN YMKASDIREL LKWATADVIS FGGGLPDLSQ LPLNEYSEVA 
KFVVSNYGLK ALQYGKTEGV DELKEELAKF MAKQGIRTDA SLILPTVGSQ EALELMARVF
IDPGDVIITE KPTYFAALQA FRVYRARIIG VDMDNDGLII DKLEDTIKRL RSEGARVKFI
YTIPICQNPT GVMMSIDRRK ALLELASRYD LMILEDNPYS YFTFDPVDTT PLKALDNEDR
VVYTSTFSKI IAPGIRLGWV VANQDIVNWL AIAKQAMNLH TPTLSQYIAY ELLRRGIVDR
YISKIKETYK VKRDAMLDAL SRYMPSGVSW TKPSGGMFIW VTVPGNVRTE DMLNLAIRKY
KVAYVPGKSF YPDEDVHNDM RLNFTYPSVD QIYDGVRRLA LAIEEYKGK