Gene Noc_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2029 
Symbol 
ID3705180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2341122 
End bp2342435 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content51% 
IMG OID637738505 
Producthypothetical protein 
Protein accessionYP_344020 
Protein GI77165495 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACCGA GCCAGGATGC AGTGGAGTTA AAAGAAATAG CGCCGGACAC TAGGGAGGAA 
GGGTTCACTT TTATGAGGCG CTACCGTCAG GTACGCCAGC TCAGTGAGAC CTTATGTCAG
CCTTTGGTGG ACGAGGACTA TGTGATTCAG ACTATGCCGG ATGTCAGCCC GCCCAAATGG
CATTTAGCCC ATAGCAGTTG GTTTTTTGAA AACTTTATTT TGATCCCTAA ATTCAAGGGC
TATCAACCCT TCCATCCGGC CTATAGTTAT TTGTTTAACT CTTACTATGA GACCGTGGGT
CAGTTCTGGC CTCGCCCGCA ACGAGGGCTG TTATCTCGTC CCACGGTAGC CGAGGTCTAT
GCCTATCGCC ACCATGTGGA TAAGAACATG GTGCGCCTAG CAGAGAATTT GGAAGCGGAG
AAGTGGCCGT CTGTTGCCTC CTTGATAGAA TTAGGACTTA ACCATGAGCA ACAACACCAA
GAATTGCTCT TGACTGATCT TAAACATATC TTTGCCACCA ACCCACTCCG TCCCGCCTAC
CAGGAGGGGG TTGTCCCTCA ATTCAAGGGA GCAAGAAAGA ATGGCAGCCT AGAATGGTAT
GACTACAAGG GAGGACTGCA TGCCTTGGGG TATTCCGGAG AAGGTTTTGC CTACGATAAT
GAAAGTCCTA ATCATCTTGT TTACTTGCGC GATTTTCGCC TCGCTTCGCG CCTCGTGACC
AACAGGGAAT ATCTAGCCTT TATGGCGGCG GGAGGATATC GAGAACCTCG CTACTGGCTT
TCTGAGGGTT GGCATACGGT GCGGCAAGAA GGTTGGCAGG CGCCATTGTA TTGGGAGCAG
CAGGGCGAAG GTTGGTGGCA AATGACTCTC CATGGGATGC AACCTGTTCA GAAAGAGGCT
CCCGTATGTC ATCTGAGCTA TTATGAAGCT GACGCCTATG CCCGCTGGGC AGGTTATCGG
CTGCCTACAG AGGCTGAATG GGAAATCGTG GCGCGAACGC TGCCATGTCG GGGTAATTTT
TTGGAGTCAG GGGCCTTACA GCCCTTACCC GCGCCTCCAG CAGCCCCTAC CCCGGTTCAA
ATGTTTGGGG ATGTCTGGGA ATGGACCGGG AGTCCCTATG CGCCCTATCC TGGTTATCAG
CCTTCCGAGG GAGCTATTGG TGAATACAAT GGAAAATTTA TGTGTAATCA AATGGTTTTA
CGAGGTGGTT CCTGTATCAG CTCATCTGAG CATTTGCGTG CTTCCTATCG CAATTTTTTC
CCTCCCCACG CCCGTTGGCA ATTTACGGGC CTTCGATTAG CGGATGATGT ATGA
 
Protein sequence
MIPSQDAVEL KEIAPDTREE GFTFMRRYRQ VRQLSETLCQ PLVDEDYVIQ TMPDVSPPKW 
HLAHSSWFFE NFILIPKFKG YQPFHPAYSY LFNSYYETVG QFWPRPQRGL LSRPTVAEVY
AYRHHVDKNM VRLAENLEAE KWPSVASLIE LGLNHEQQHQ ELLLTDLKHI FATNPLRPAY
QEGVVPQFKG ARKNGSLEWY DYKGGLHALG YSGEGFAYDN ESPNHLVYLR DFRLASRLVT
NREYLAFMAA GGYREPRYWL SEGWHTVRQE GWQAPLYWEQ QGEGWWQMTL HGMQPVQKEA
PVCHLSYYEA DAYARWAGYR LPTEAEWEIV ARTLPCRGNF LESGALQPLP APPAAPTPVQ
MFGDVWEWTG SPYAPYPGYQ PSEGAIGEYN GKFMCNQMVL RGGSCISSSE HLRASYRNFF
PPHARWQFTG LRLADDV