Gene Jann_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4050 
Symbol 
ID3936538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4154340 
End bp4155503 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content66% 
IMG OID637906435 
ProductAcetyl-CoA C-acetyltransferase 
Protein accessionYP_511992 
Protein GI89056541 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAG TCGTTATCAC GGGTGCGGCG CGCACCCCCA TGGGCGGGTT TCAGGGCGCC 
TTGTCCCCCG CCACGGCGTC CGAGCTTGGC GGTGCGGCGA TCAAGGCAGC CATGGGCAAT
GCCACGGTCG ATGAACTGTT GATGGGCTGC GTCCTGCCCG CAGGCCAGGG TCAGGCCCCG
GCGCGGCAGG CGGGCTTCCA TGCGGGCCTG GGCGACAGCG TGCCTGCGAC AACGCTCAAC
AAGATGTGCG GCTCCGGTAT GAAGGCCGCG ATGATGGCCT ATGACACGCT TGCCCTCGGT
CAGTCCGATG TGATCGTCGC TGGCGGCATG GAATCGATGA CCAATGCGCC CTACTTGTTG
CCCGCCATGC GGGGGGGCGC GCGGATCGGG CACCAGAAAA CGCTGGACCA CATGTTCCTC
GATGGCCTTG AAGATGCCTA CGACAAGGGA CGTCTGATGG GCACCTTCGC GGAAGATTGC
GCCGAGGCGT TCCAGTTCAC CCGGGACACC CAGGACGCCT ATGCGCTGGG GTCGCTGGAA
AACGCCCTGG CGGCGATCAA GAGCGGCGCG TTCGACGAAG AGGTCACGTC TGTCACGATC
ACCACCCGCA AAGGCAGCGC TGACGTCGCA ACGGACGAAC AACCCGGCAA TGCCCGCCCC
GACAAGATCC CTCAGCTCAA GCCTGCGTTT CGCGAGGGTG GCACGGTGAC GGCGGCGAAT
TCCTCCTCCA TCTCCGACGG CGCGGCAGCC TTGACGCTTG CACGCAAAAG CGCGGCAGAG
GCGCAAGGCC TCCCCATCCG CGTCCGCATC CTCGGCCACG CGTCCCATGC CCATGCGCCC
GCACTGTTCC CCACGGCCCC CGTGCCCGCC GCCCGTAAGC TGTTGGACCG TATCGGCTGG
TCCATTGATG ACGTCGACCT TTGGGAGGTG AATGAGGCCT TCGCCGTGGT TCCCATGGCC
TTCATGCACG AGATGAACAT CCCGCGGGAA AAGATGAACG TGAACGGCGG GGCCTGTGCC
CTCGGCCACC CGATTGGTGC ATCCGGCGCG CGGATCATCG TCACCCTTCT CCACGCCCTT
GAAGCCCGCA ACCTCAAGCG TGGCATCGCG GCGATCTGCA TCGGCGGCGG TGAGGGCACG
GCGATCGCGA TTGAACGGCC ATGA
 
Protein sequence
MEEVVITGAA RTPMGGFQGA LSPATASELG GAAIKAAMGN ATVDELLMGC VLPAGQGQAP 
ARQAGFHAGL GDSVPATTLN KMCGSGMKAA MMAYDTLALG QSDVIVAGGM ESMTNAPYLL
PAMRGGARIG HQKTLDHMFL DGLEDAYDKG RLMGTFAEDC AEAFQFTRDT QDAYALGSLE
NALAAIKSGA FDEEVTSVTI TTRKGSADVA TDEQPGNARP DKIPQLKPAF REGGTVTAAN
SSSISDGAAA LTLARKSAAE AQGLPIRVRI LGHASHAHAP ALFPTAPVPA ARKLLDRIGW
SIDDVDLWEV NEAFAVVPMA FMHEMNIPRE KMNVNGGACA LGHPIGASGA RIIVTLLHAL
EARNLKRGIA AICIGGGEGT AIAIERP