Gene Noc_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0999 
Symbol 
ID3707391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1106497 
End bp1107486 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID637737504 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_343037 
Protein GI77164512 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.170606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCT TTGTTACCGG TGCAACCGGC TTCGTGGGCT CGGCGGTAGT CAAACAACTA 
CTCGATGCGG GAGAAACCGT CCGGGTATTA GCCCGCTCTA ATAGCAATCG GCGTAATTTG
GAAGGGCTAC CCGTAGAAAT CTTTGAAGGC GATCTCAAGG ATCAACGGCG CCTAGAAAAA
GCGCTCCATG GCTGCCAAGC CCTTTTTCAT GTCGCGGCTG ATTACCGCTT ATGGGCCCCC
CGTTCGCAAG ACTTTTATGA CACTAATGTC CAGGGCTCCC AAAATATCAT GCACGCTGCG
GCGGAAGCCG GCGTTAACCG TATTGTTTAC ACCAGTAGCG TCGCCACGCT GGGATTAAGC
CCTGATGGCA CGCCAGCCCA TGAGGAGACC CCTTCCAGCC TTGAAACCAT GATCGGCCAC
TATAAACGAT CTAAATTCTT GGCTGAGGAG TCCATCAAGG ATCTAGGACA CCGGCTGAAG
CTGAATATCG TGATTGTGAA TCCCTCCACC CCCATTGGAC CGCGGGATAT CAAACCCACG
CCTACCGGCA GAGTCATTGT CATGGCCGCT TCGGGAGGCA TGCCCGCCTA TATGAACACT
GGCCTCAACG TAGTCCATGT GGATGATGTG GCCGCCGGCC ATCTCCTGGC CTTTGAAAAA
GGCCGCATTG GCGAGCGTTA TATCCTTGGA GGAGAAAATC TCACTCTGCG CGATATTCTA
GAAGCAATCG CCCAGCTTAC CCACCGCCGT CCCCCCAGAT TCCGGCTTTC CCCTCATGCG
GTCCTGCCTA TCGCCCATCT TGCCCAAGGC TGGGCCCGCC TGACCAGCGG CAAAGAACCC
ATGACCACTG TCGATGGCGT GCGGATGGCA AAGAAATATA TGTTTTTTTC TAGCACTAAG
GCGGAACAAC TGCTAGGTTA TCAATCCCGC CCCGCCCAGG AAGCGATTCA TGATGCGATT
AGCTGGTTTC AGGCTAATGA CTATCTGTAG
 
Protein sequence
MTSFVTGATG FVGSAVVKQL LDAGETVRVL ARSNSNRRNL EGLPVEIFEG DLKDQRRLEK 
ALHGCQALFH VAADYRLWAP RSQDFYDTNV QGSQNIMHAA AEAGVNRIVY TSSVATLGLS
PDGTPAHEET PSSLETMIGH YKRSKFLAEE SIKDLGHRLK LNIVIVNPST PIGPRDIKPT
PTGRVIVMAA SGGMPAYMNT GLNVVHVDDV AAGHLLAFEK GRIGERYILG GENLTLRDIL
EAIAQLTHRR PPRFRLSPHA VLPIAHLAQG WARLTSGKEP MTTVDGVRMA KKYMFFSSTK
AEQLLGYQSR PAQEAIHDAI SWFQANDYL