Gene Noc_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2009 
Symbol 
ID3705199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2317596 
End bp2318786 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID637738486 
Producthypothetical protein 
Protein accessionYP_344001 
Protein GI77165476 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0383885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACACG ACGGCACCAT ACGCACCAAT GGGAAAGCGA TCATTGAAAC GGATAACTTG 
CCAATAAAAC TGTGGTTGGA AGAGGATCAA ATGGAAGAAG GGGCGCTGGA GCAGGCGCGA
AATCTTGCGA ATCTTCCATT TGCCTTCAAA CACATTGCTA TCATGCCTGA TACCCATCAA
GGCTACGGCA TGCCTATCGG TGCTATATTG GCCACCAAGG GGGCTATTAT ACCCAATGCT
GTCGGTGTGG ATATTGGGTG CGGCATGTGT TCCTTGCGGA CCAATCTCGA GCATATCGAA
ACGCCAAAGC TGAAAGAGAT CATGGGTATC ATCCGCAAGA CCGTTCCTGT GGGCTTTGAG
CATCACAAAA CGCGTCAAGA CGAAGCCTGG ATGCCTGAGA GAAAGGGGGA ATTACCCATT
GTTGAGCAAG AGTATGAAAG TGCCCTTTAT CAGATCGGTA CATTGGGCGG AGGCAATCAT
TTCATCGAAA TACAAAAGGG ATCGGATGGC TATATCTGGA TTATGATTCA CTCCGGCTCC
CGCAACATTG GTTTCACGGT GGCCAACCAT TACGAGGGCG TAGCGAAAAA GATGAACCAG
GACGCCGGCG AGGACGTGTC GCAGGAACTG GCATATATTC CCGAAACGTC TGAATATTTC
AAACTGTATT GGAACGAAAT GAACTATTGC CTCGAATTTG CACTGGCCAA CAGAAAACTG
ATGATGGAAC GGGCCAGGTC GGCGTTTACC GAGATTTTAC CCGAGGTCGA ATTCGCGGAT
TTTATCAATA AACCTCACAA CTTCGCGGCC GAGGAAAAAC ATTTTGGAGA GTGGGTCATC
GTCCATAGAA AAGGCGCGAC GCGAGCCCGA AAAGGAGAAT GGGGAATGAT CCCCGGCTCC
CAGGGCACAC GGTCTTTTCT CGTGAAAGGG AAAGGAGAAG CCCAGTCTTT CGAATCGTGC
GCGCACGGTG CCGGAAGAAT CATGAGCCGA ACAAAAGCGC GCAAAACACT GGATCTGAAG
GAAGAGGTAA AGGCCCTGAA AGACCGAGGA ATACTACACG CTATCCGCCA CCGCAAGGAT
CTGGATGAAG CGCCGGGATC TTACAAGGAC ATCGATGAGG TAATGGCAAA CCAGGTCGAT
CTGGTCGACG TGCAAATCGA GCTGCAGCCA CTGGCTGTCA TCAAGGGTTA A
 
Protein sequence
MKHDGTIRTN GKAIIETDNL PIKLWLEEDQ MEEGALEQAR NLANLPFAFK HIAIMPDTHQ 
GYGMPIGAIL ATKGAIIPNA VGVDIGCGMC SLRTNLEHIE TPKLKEIMGI IRKTVPVGFE
HHKTRQDEAW MPERKGELPI VEQEYESALY QIGTLGGGNH FIEIQKGSDG YIWIMIHSGS
RNIGFTVANH YEGVAKKMNQ DAGEDVSQEL AYIPETSEYF KLYWNEMNYC LEFALANRKL
MMERARSAFT EILPEVEFAD FINKPHNFAA EEKHFGEWVI VHRKGATRAR KGEWGMIPGS
QGTRSFLVKG KGEAQSFESC AHGAGRIMSR TKARKTLDLK EEVKALKDRG ILHAIRHRKD
LDEAPGSYKD IDEVMANQVD LVDVQIELQP LAVIKG