Gene Noc_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1251 
Symbol 
ID3706363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1372390 
End bp1375767 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content52% 
IMG OID637737753 
Producthypothetical protein 
Protein accessionYP_343282 
Protein GI77164757 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAC ACCACGCCGA ACAGGCGGAA TCGCCCTTAG ATAACCGAGA AAAACTGACT 
ACCGCGGTAG ACAAGTTAGA ACTCGTGCTG CCATGCGCAG CCCCCCTGTA TGACTTCATG
CACTTAAACA CCATGCAGGG CTACCACCAT ATCTCCTTCG CCGAAGCCAT GGCGGCTCAC
TTTGAGCTGA CTGGCATCCG AGGTTACTTG CCTGAAGAAG ACTTCCGTAA GCATTATGCC
CGGGGCCGTA TTGACGATGC TGACCTTAAC GAATCCCTAG CAAACAATAC CCATGGCCGT
AATCGCGAAA TTGTACTGAA AGTTAGTGGC CGTCCTATCA ATAAACAGGA CATTTGGCGT
ATCAGCCTAA TCCAGGACAT TAACCCTCTT TCTCCCAGCC GTTTTCGTTG GCAAATCAAG
GAATATGATG CTCTGGAGCG TTTTCAGAAC GGCGTGCCAA AATCTGCCCG CGACACACTG
CTTAACGCTA CCCAAGAGAC AGATCAAAAC CGCCGTACGG AAAGCCAGGC CATCCGAGAT
CTATGGGAAG CATGTTTAAG GGTTTTCCAG TTAGAAAATC CTAATCTGCA TTCGGAAGAG
TTAGGGGAAC TGGTAGACCT AGAAGAATTT TCCCGCAGCC AGGCCGAAGG TAAACAGAGT
AAATTCCGCA CAAGCCAACC CACGCTTATC CCCCAAGAAG AAATGCTTGC TGAAGCACGA
AAAGATCTTC ATCATCTAGT CGATAAGGTA GGTGAGGAAC TCTCCCTGCG TGGTCTATTA
CAAGCACTGA CTGGTGAGGA TCTTTTAGAT CAGGTTCGTC CCATCTTGAT TCGCTTCTGT
GCTTCCCATC TTGATGAAGG CTTCACAGCC TGGAGTCTAC CGGAGCGAGG GCAAGGGCTC
TATGCCGCAT GGCGAAAGTG CCCCTTCGCT GAACTCGGAC TAGACTTGGC TAGGCTACCG
GATTGGCAGT CGTTCCATGC CGAGTTACCA GAGCACTCAG TAGATGCCGT TATTGCCTGC
CTTGAGCGAT TGAAAATCCC AGAATCCCGT TGGGAAGGTT ACCTGAAACG GATAGCTGTG
GAACTTCCAG GCTGGTCCGG TCTTATTAAC TGGCGCCACC ATCGGCCCAA ATATAAGCCG
AATCGGAAGG CCCCTACTTC GCTTATGGAT TATCTGGCTA TCCGCCTCTT CTTGGATGTG
ATTCATATTG AACAGGTTAC CCAAAACACC TGGGGTATCG CCGGTAATCT AGAGGAACTC
AAAACTTACT TCGAGAATTA CCTTTGGGAG TTCTCTGGGC GGTATGCGCT ATTTTCAAAC
ACGCTCCCAG AATATTTGGC CATCAGAGCA CAAGAACTGA TAGCTCTGCC CAGAACTGCC
CAAAAGGATC GGGAAAATTG GCGCACGGTC TCCAATATGA TTCATAACTG GAAGCACAAT
CCTTCGGCGG AAAGGACAAA ACGTCAAACC GTGCATTCTC ATGTCTGGCG TTTATTCTGC
CTAGCACAGC ATCTAGGTCT GCCGGGAAAT GAGGTGGGCA AACTTTCCTC CAGCGAGGCT
GAACAACTGC TCGCCATCCT GGATGAATTG ACCACCTCAG AACGGGGGTA TATCTGGCTC
TGTGCTTATG AGTACCATTA CCGGGAGGAT TATTTCGCTG CGCTGACTCA GAATCATGGG
CGTGGCCGCT GGGCCAACCG GAATGAACGC CCAGAGGCAC AACTGATCTT CTGTTTCGAC
GACCGGGAGG AAGGTATTCG CCGCCATTTA GAGGAAGTCA ACCCTAACCT TGAAACCTTG
GGGGCGCCAG GCTTTTTCGG TGTACCCATT CAGTGGCGCG GGTTAGATTA CCCTGATACC
ACCCCCCATT GTCCGGTTGT GGTAACCCCT GTTAATGAAC TTCATGAAGA ACCTCGTCCG
GAAGCAAAAA AACGGTACGG CCTCCATAAA CGCCTCTATA ACTTCAAGCA ATTTCTACTT
CGCGTACTTC ATAATAAAAC CCGCCGCGAC CTATTGACTT CAAAGGTATT AATCGATGTC
CTCTTCCCTG GAATGCTAGC TGTGCTAGCA GGCAAGGTTT TCTTTCCCTT CCAGCAAGCA
TCGCTAAAAC GCAAAGCTAC CGCTGCGCTT GTGCCACCCG TACCGACCCA ATTGAAACTT
ACTGTACCGG ACGACGGCAC GGAAGCCACA CCCGATAATC TACGTGTAGG ATTCACCGAT
GCTGAGCAAG CTGAACGATT AGCAGCCTTT CTACGCGCTA TCGGGTTTAC CTCTGGCTTT
GCGCCGCTGG TGGTGCTATC CGGGCATGGC TCCATGAGCG AAAATAACCC CCAGCTAGCT
TCTTATGACT GCGGCGCCTC CGGCGGGCGC CATGGCGGAC CCAATGCCCG CGCTTTTGCG
GCCATGGCTA ACCGGCCGGA AATACGGGCA CGGCTCGCTG AACAAGGTAT CCACATTCCT
GACGACACCT GGTTCATCGG CACGGAACAT GATACTTGTT CCGAATCGTT CCCCTGGTTC
GATCTGGATA AAGTGCCCGC TAATTTCGCG CCCGCCCTGA AGAAATTGAA AGCCGAAGTT
GATCAGGCCC TCCTGCTGTC AGCCCATGAA CGTTGTCGGC GCATGGCTTC GGCACCCCGT
AAACCTAGTC TACAGCAGGC CAGGAGACAT GTTGCCGAAC GTGGCACGGA CTTCAGCCAG
GCCCGGCCAG AGCTAGGCCA TGCCACGGTT GCCTCGGCGC TTATCGGGCG CCGTTCCGTC
ACCCGGGGGA TATTTCTAGA CCGCCGCTGC TTCGTGCTTT CTTATGATCC AACTATTGAT
GACGCAGAAG GCACTATTCT TGAAGGCGTC CTTAAGAACG CTGGCCCGGT AGGCGTAGGT
ATTAATCTGG ATTATTACTT TTCGGCCGCC AATAACCAGG GATTTGGTAG TGGCTCAAAG
GTAGCCCAAA ACGTAACCGG TCTATTTGGC GTCATGCAGG GTATTGACGA TGATTTACGC
ACCTGGTGTT CCTATCAAAT GGTTGATGTT CATGAGCCCA TGCGCGTTCT AACCGTAGTG
GAAGCAACCA CGGAAACCCT GACCGCTATC TACAAGCGCC AACCCTCTTC CCAAGAGCCC
GCCGGAGGTA GCTGGTTGCT GCCACCTTTG CGCCAGCTCA TTGATGGCGG TTGGCTGCTG
TTAGCAGCTA TCCACCCAAA AACCGGCAAA ATTTCGGTGT TCGATCCTAA GCAAGGATTT
ATTCCCTGGA AAAGCTATCG GGAACCCTCA CCCTTGCCGG TGGTGGAGCG CTCCATGGAC
TGGTATGACG GTTATAGCGA TCCTAGGCCA CCTGCGCTAG TTGAACCCAA ACAGACGGAG
ACCCATCATG CTGCCTGA
 
Protein sequence
MSAHHAEQAE SPLDNREKLT TAVDKLELVL PCAAPLYDFM HLNTMQGYHH ISFAEAMAAH 
FELTGIRGYL PEEDFRKHYA RGRIDDADLN ESLANNTHGR NREIVLKVSG RPINKQDIWR
ISLIQDINPL SPSRFRWQIK EYDALERFQN GVPKSARDTL LNATQETDQN RRTESQAIRD
LWEACLRVFQ LENPNLHSEE LGELVDLEEF SRSQAEGKQS KFRTSQPTLI PQEEMLAEAR
KDLHHLVDKV GEELSLRGLL QALTGEDLLD QVRPILIRFC ASHLDEGFTA WSLPERGQGL
YAAWRKCPFA ELGLDLARLP DWQSFHAELP EHSVDAVIAC LERLKIPESR WEGYLKRIAV
ELPGWSGLIN WRHHRPKYKP NRKAPTSLMD YLAIRLFLDV IHIEQVTQNT WGIAGNLEEL
KTYFENYLWE FSGRYALFSN TLPEYLAIRA QELIALPRTA QKDRENWRTV SNMIHNWKHN
PSAERTKRQT VHSHVWRLFC LAQHLGLPGN EVGKLSSSEA EQLLAILDEL TTSERGYIWL
CAYEYHYRED YFAALTQNHG RGRWANRNER PEAQLIFCFD DREEGIRRHL EEVNPNLETL
GAPGFFGVPI QWRGLDYPDT TPHCPVVVTP VNELHEEPRP EAKKRYGLHK RLYNFKQFLL
RVLHNKTRRD LLTSKVLIDV LFPGMLAVLA GKVFFPFQQA SLKRKATAAL VPPVPTQLKL
TVPDDGTEAT PDNLRVGFTD AEQAERLAAF LRAIGFTSGF APLVVLSGHG SMSENNPQLA
SYDCGASGGR HGGPNARAFA AMANRPEIRA RLAEQGIHIP DDTWFIGTEH DTCSESFPWF
DLDKVPANFA PALKKLKAEV DQALLLSAHE RCRRMASAPR KPSLQQARRH VAERGTDFSQ
ARPELGHATV ASALIGRRSV TRGIFLDRRC FVLSYDPTID DAEGTILEGV LKNAGPVGVG
INLDYYFSAA NNQGFGSGSK VAQNVTGLFG VMQGIDDDLR TWCSYQMVDV HEPMRVLTVV
EATTETLTAI YKRQPSSQEP AGGSWLLPPL RQLIDGGWLL LAAIHPKTGK ISVFDPKQGF
IPWKSYREPS PLPVVERSMD WYDGYSDPRP PALVEPKQTE THHAA