Gene Noc_3012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3012 
Symbol 
ID3705720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3405155 
End bp3406261 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content54% 
IMG OID637739486 
ProductSMF protein 
Protein accessionYP_344984 
Protein GI77166459 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC GAGCCTACTG GCTCGCCTTG CACCGCGCTC CTGGTGTCGG CAGTGTCAGT 
TTTTGCCGCC TCTTGGAGAA ATATGGCTCG CCGACTGCCT TATTTACTTC CCCGGAAAGG
CTGGCGGGCC TTAGCGATGG AATCCAGCAT TATTTGCGGC AACCTGATTG GAAAGCGGTA
GAGCAGGATC TAAAATGGCT GGAGCAGCCA GACCATTATT TGCTCACTTT GGCCGATCCA
GGATATCCGC CGCTACTGCG GGAGATTCCC GACCCGCCTC CCATCTTATT CGTCCATGGT
GATCCGTCCT TGCTCTCTTT ACCCCAACTA GCAATTGTGG GCAGCCGCAA TCCTTCCCCT
GCGGGGGCTG AAACTGCTGC GCAGTTTGCT ACCTATCTGG CCAATTCAGG TTTAGTTATT
AGTAGCGGGC TGGCGCTTGG TATTGATGCT GCCGCCCATG AGGGAGCGCT AGCTGCAAAA
GCGGCGACGA TAGCTGTGGC GGGAACCGGG CTAGACAGAG TTTATCCGGC CCGTCATCAT
GCCTTGGCTC ATGCCATTGC CGAGAGCGGG GCATTAGTAT CAGAGTTCCC CATTGGAACT
CCTCCGTTAC CTCAGAATTT CCCACGCCGT AACCGGCTTA TCAGCGGCCT TAGCTGGGGT
ATTCTTGTGG TTGAAGCCGC TTTACAAAGT GGCTCTCTCA TTACAGCCCG CCTAGGCGCG
GAACAGGGGC GGGAGATATT TGCTATCCCT GGTTCTATCC ATAACCCCCT CGCCCGGGGC
TGTCATCATC TTATCCGAGA GGGTGCCAAG CTAGTGGAAG CCGCCCAAGA TATTTGGGAG
GAATTGGGAT CTTTGGCAGG CGCAATACCA AACCTCCAAT GCCAGGAAGC GCCCCAAAAA
ATAGAGGCAT CAACCGATGA TCTGGAATAT CAACTTCTAC TGGATTGCTT AGGTTATGAT
CCTCTTCCCA TAGATCTCTT AGTTGAGCGT TGTGGATTGA CGGCAGAAGC GGTTTCCTCC
ATGCTTTTAA TATTAGAGTT ACAAGGCCGC ATCACGGCAT TGCCTGGAGG ACGCTACCTC
CGATGCGGTA AAGAGGGCCA ATCATGA
 
Protein sequence
MDERAYWLAL HRAPGVGSVS FCRLLEKYGS PTALFTSPER LAGLSDGIQH YLRQPDWKAV 
EQDLKWLEQP DHYLLTLADP GYPPLLREIP DPPPILFVHG DPSLLSLPQL AIVGSRNPSP
AGAETAAQFA TYLANSGLVI SSGLALGIDA AAHEGALAAK AATIAVAGTG LDRVYPARHH
ALAHAIAESG ALVSEFPIGT PPLPQNFPRR NRLISGLSWG ILVVEAALQS GSLITARLGA
EQGREIFAIP GSIHNPLARG CHHLIREGAK LVEAAQDIWE ELGSLAGAIP NLQCQEAPQK
IEASTDDLEY QLLLDCLGYD PLPIDLLVER CGLTAEAVSS MLLILELQGR ITALPGGRYL
RCGKEGQS