Gene Jann_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3803 
Symbol 
ID3936283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3888386 
End bp3889336 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content61% 
IMG OID637906181 
ProductN-acetylmuramic acid-6-phosphate etherase 
Protein accessionYP_511745 
Protein GI89056294 
COG category[R] General function prediction only 
COG ID[COG2103] Predicted sugar phosphate isomerase 
TIGRFAM ID[TIGR00274] N-acetylmuramic acid 6-phosphate etherase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0010924 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000127342 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGAGTCG CTGCCGTGTT TACCGCCGTC CAGTCAGATC TTGGGGCACT TGTGTCCGAG 
GCGAGCAATT CGCGCTCTGC GGACATTGAT CTGATGACCA CAGCCCAGAT CCTGGCCTGC
ATGAACGCCG AGGATCGTAA AATCGCCGAT GCCGTCGCAG CAGAGCTTCC CGCGATTGCC
CAGACTGTTG ACAGGATCGT CGCAGCGATT GGCCGTGGCG GGCGCCTTAT CTACATCGGT
GCGGGCACCA GCGGTCGTTT GGGCGTATTG GATGCATCTG AATGCCCGCC CACGTTTTCC
GTCCCTCCCG GCATGGTGGT TGGCCTGATC GCCGGTGGCG ACACAGCGCT GCGCACCTCG
GTTGAGGCGG CCGAAGATGA TGAGGCAACG GGTGCGGAGG ACGTGAAAGC CATCGGGCTG
ACAACCAAAG ATGTCGTCAT CGGTATCGCG GTCAGTGGCA GAACCCCCTT CGTGATGGGC
GCGATAGACT ACGCCCGCCG CATTGGCGCG TTCACTGCCG CGCTGACCTG CAACCCAGGC
TCGCCCATGG CGGACCTTGC TGACATCGCG ATCTCACCCG TTGTCGGGCC GGAGGTTGTG
ACCGGCTCCA CGCGCCTCAA ATCCGGGACC GCGCAAAAAA TGATCCTGAA CATGCTGAGC
ACCGCCAGCA TGATCCGCCT TGGTAAGACA TGGGGCAACC GGATGGTGGA TGTGACGATT
TCAAATCGGA AATTGGCGGA CCGCGCCACT GCCATGTTGC GGGATGCCAC CGGGTGCAGC
GCCGATGATG CGCGTACTTT GCTGGACCAA AGCAATGGCA GCGTGAAACT TGCCATCCTG
ATGCAGATTA CGGGCTGTGA CGCAGATGCG GCCCGCGCAA ATCTGGAGGC TGAAAACGGC
TTCCTGCGCA AAGCCATTGA ACGAGCGGAG AAAACTCCGC CGCAAAGCTA G
 
Protein sequence
MGVAAVFTAV QSDLGALVSE ASNSRSADID LMTTAQILAC MNAEDRKIAD AVAAELPAIA 
QTVDRIVAAI GRGGRLIYIG AGTSGRLGVL DASECPPTFS VPPGMVVGLI AGGDTALRTS
VEAAEDDEAT GAEDVKAIGL TTKDVVIGIA VSGRTPFVMG AIDYARRIGA FTAALTCNPG
SPMADLADIA ISPVVGPEVV TGSTRLKSGT AQKMILNMLS TASMIRLGKT WGNRMVDVTI
SNRKLADRAT AMLRDATGCS ADDARTLLDQ SNGSVKLAIL MQITGCDADA ARANLEAENG
FLRKAIERAE KTPPQS