Gene Cmaq_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1941 
Symbol 
ID5709676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp2017398 
End bp2018588 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content46% 
IMG OID641276449 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001541748 
Protein GI159042496 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.545101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATG AAGTTGTGAT AGTGGGGTAT GTGAGGACCC CCATAGGTAA GTTCGGTGGT 
TCACTTAAGA GTGTTAAATC ACCTCACTTG GCTGCTGAGT CGATAAGGGC ATTATTAAGG
AGGACTAAGG TTGATTCAAG TATGATTGAT GAGGTTATAT TCGGCTCAAC ATTACAGGGT
GGGATGGGGC AGAATATTTC CCGCTACGCA GCATTACTGG CTGGTTTACC GAATTCAGTC
AGTGCCTATA CGGTTAATAG GGTTTGTTCA TCAGGTATGC AGGCAATTAT TGATGCTTAC
AGGGAATTAG TGCTTGGTGA TGCATCACTT ATTATTGCTG GGGGTGTTGA CTCAATGAGT
ACTCAACCAA TAGCATTACC CAGCGAGTAT AGGTGGGGTG TTAAGCACTT CATAGCTAAG
ACTATTCAAC CAATAGACCT AATGGTTTAC GATGGTTTAA TAGATCCAGT AACAATGATG
ATTATGGGGC AGGAGGCTGA CTTAGTGGCT AAGGAGAATG AGTTAACTAG GGATGAGTTA
GATAATTACG CCTACATGAG CCACATGAGG GCTGTTAAGG CCACTGAGGG TAAGTTATTC
AAGGAGATTG AGCCAATAGA CACAACAATA GAGGGTGAGA GGGTTAAGCT TGATCACGAT
GAGGGAATAA GGCCTGATAC AAGCCTAGAG AAGCTTAAGG CCCTTAAACC AGCCTTCACC
CCAAATGGAT TCCACACAGC CGGTAACTCA TCGCAGTTGA GCGACGGGGC TGCGGCATTA
TTATTAACAA CAATGGATAA GGCCAAGGAA ATGGGGTTAA GGCCAGTGGC TAAGATACTT
GGTTACGCAT GGTACATGAT TGAGCCAAGG AGGTTCACCG AGGCGCCGAC GTACGTTATA
GATAAGGTAC TTAGGAAACT CAACCTAAGC ATTAACTCCG TTGACTACTT TGAGGTTAAT
GAAGCCTTCG CAGTGGTTAA CGTACTGGTT AATAAGAGGC TTGGTGTACC GTACGATAAG
ATGAACATAT TCGGTGGCGC AATAGCCATC GGCCACCCCC TAGGCGCCAG TGGGGCTAGG
ATAGTGACTA CCCTGTTAAC CGGCCTTGAG CACACTGGTG GTAGAATCGG TGTTGCTGCC
CTATGCCACG GCACTGGGGG AGCCACTGCA CTAGTTGTTG AGAGACTGTG A
 
Protein sequence
MSNEVVIVGY VRTPIGKFGG SLKSVKSPHL AAESIRALLR RTKVDSSMID EVIFGSTLQG 
GMGQNISRYA ALLAGLPNSV SAYTVNRVCS SGMQAIIDAY RELVLGDASL IIAGGVDSMS
TQPIALPSEY RWGVKHFIAK TIQPIDLMVY DGLIDPVTMM IMGQEADLVA KENELTRDEL
DNYAYMSHMR AVKATEGKLF KEIEPIDTTI EGERVKLDHD EGIRPDTSLE KLKALKPAFT
PNGFHTAGNS SQLSDGAAAL LLTTMDKAKE MGLRPVAKIL GYAWYMIEPR RFTEAPTYVI
DKVLRKLNLS INSVDYFEVN EAFAVVNVLV NKRLGVPYDK MNIFGGAIAI GHPLGASGAR
IVTTLLTGLE HTGGRIGVAA LCHGTGGATA LVVERL