Gene Noc_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1959 
Symbol 
ID3704973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2245504 
End bp2246460 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content52% 
IMG OID637738435 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_343951 
Protein GI77165426 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID[TIGR03100] hydrolase, ortholog 1, exosortase system type 1 associated 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.175259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACT ATGTTGGGAA TGGGTATAGG GTAGAGGGGT TTCCAGGCGC ACTCAAAGCC 
AGGGGAGACA TAACGCATAT TTATAACGGA CAGTCGGTGT CCAGGGTGAC GGCTGAAGAA
GCACTGACCT TCCATTGCCT TGGAGAGCTT CTCGTTGGCA TCTTGCACCG AGGTTCAGAG
TACGCCACCC GGGGTGTCTT GGTCGTTGTT GGTGGCCCCC AATACCGGGT AGGCAGTCAT
CGCCAGTTTG TATTGTTTGC CCGCTGGTTG GCGGAAGCGG GAGTTCCTGT ATTCCGTTTC
GATTACCGTG GAATGGGAGA CAGCGGTGGC GGTACTCGTA CCTTTGAGAA TATTGAAGTT
GATATCCGTG CAGCGATCGA TGCTTTTCTA GAAGCTGCGC CAGGGTTAAG AGAAATCGTG
ATTTGGGGCC TTTGCGATGC CGCCTCGGCG GCCTGTTTTT ATGCGCCCTC AGATCCGCGA
GTAGCAGGTT TGGTGTTGTT GAACCCCTGG GTGCGAACGG AGGAAGGGCA GGCAGCCGTT
TACCTCAAGC ATTATTATTT TAGAAGGTTA GTTAGCGGCG ACTTTTGGCG CAAGTTTTGG
CGCCGGGAAT TTGATTATAA GGATTCACTG CGTTCATTGG GAGATATATT AAGGAAAGCT
AATTCCTGGC GGCAGAAGGT TGATGAAGTT GAGACTGAAG AAATATTGCC GTTGCCCAAG
CGGGTATATA AAGCTTTAGA GCAATTCCAG GGCAGGACGC TCTTGATACT AAGCGGCAAG
GATCTGACAG CGAATGAGTT TCGCGATACC ATCTCGTCTT CATCCGCTTG GCGCGGCTTG
CTTCGCAGTA GAAGCATTGA GCGTCGCGAG TTGTCGACTG CGGACCATAC CTTCTCCCGC
CGCGTTTGGC GGGATCAGGT GGCTCAGTGG ACCCTTGAAT GGGTGCGGTC ATGGTAA
 
Protein sequence
MMNYVGNGYR VEGFPGALKA RGDITHIYNG QSVSRVTAEE ALTFHCLGEL LVGILHRGSE 
YATRGVLVVV GGPQYRVGSH RQFVLFARWL AEAGVPVFRF DYRGMGDSGG GTRTFENIEV
DIRAAIDAFL EAAPGLREIV IWGLCDAASA ACFYAPSDPR VAGLVLLNPW VRTEEGQAAV
YLKHYYFRRL VSGDFWRKFW RREFDYKDSL RSLGDILRKA NSWRQKVDEV ETEEILPLPK
RVYKALEQFQ GRTLLILSGK DLTANEFRDT ISSSSAWRGL LRSRSIERRE LSTADHTFSR
RVWRDQVAQW TLEWVRSW