Gene Noc_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1898 
Symbol 
ID3705491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2166124 
End bp2167851 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content50% 
IMG OID637738376 
Producthypothetical protein 
Protein accessionYP_343893 
Protein GI77165368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.487361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA AATGTATATT TTGGAAAAAC GTAACATTCG CGCTAACTAT ATTAACACCG 
CTGCCTTTAC TGGCAGAAGC AACTTCAGCG CCGGTAGCTA TAGATCAGGG TGCTGAGTGG
ACAGCTTCTG CTCAAAAAGA TTTTTATAGC CGTGATCAAG GCTCGCGGAT CATGCCGCTT
CGCTGGATGG CTGCGTTGAA ACAGCCAAAT GGCGAACCTT TCATGGCGGC GAGTCTTAGT
CGGTATGGTT ATCTGCCGAA CGAGGACAGC AACCCCCCCG GCCTGCCGGT GGGCTTTACC
GTCGCCAGCG GTAGCGATGG CCAATATATC GGCATGACCT GTGCTGCATG TCACACACGG
CAGATCGAGG TAGCAGGGAC TTTTTATCGG ATCGATGGTG GCCCGGCTAT TGCTGATTTC
CAGAGTTTCC TGGCCGATCT CGATACTGCG GTAAATACTA TCCTTACCGA TCAACAAGCC
TTTAAGAATT TCGCCCATGC GGTACTTGGC CCGTCGCCAA CGACAAGCGA AGAGAATAAG
CTGCATGAGG CAGTGCAAAC TTGGTATTTG CCTTATCACA CGCTGATGGA AGGTGCGTTG
CCACCCTCGC CCTGGGGACC AGCGCGTCTC GATGCCGTAT CCATGATTTT TAACCGACTT
ACCGGCCTGG ATATTGGCCC CCCTCCTACT TACATGATTC CAGAAAATAT TAAGCCTGCC
ACGGCACCGG TACGATATCC TTTTCTTTGG AACGCGGCAA TCCAGGATAA GACACAGTGG
CCTGGTTTCG CTGACAATGG CAACAATATT CTGGGGCTCG CGCGTAATCT TGGAGAAGTC
TACGGAGTCT TCGGCGTTTT TCATCCTAAA AAGGATAAGT GGCGACTGCT TGGGATTAAT
TACCTAGCCA ATAATTCAGC TAATTTCCAA GGCTTAAATG CATTGGAGAA TCTGGTGCGA
AAAATTGGCC CGCCGAAATG GCCATGGGAA GTAGATCAAG CTCTTGCTAG CAAGGGCAAG
GAAGTTTTTG AGCGTAAGGC TGAACAGGGC GGTTGTATTG GCTGTCATGG GATCAAACCT
GGGGAAACCC GCTTCTTGAA CCAAAAAACC TGGGCCACGC CGATTCAAGA TGTCGGTACG
GACTCCAAAG AATATGAAAT CCTTGGCTGG ACTGTTAAGA CCGGCGTGCT TGAAGGCGCG
AAAATTCCTT TCCTTGCCGA ACCGCTTAAA CCTGTTGACA CAGCCTTCAA TGTACTGGGA
ACATCCGTCA TCGGCTCTAT CCTCCAGCAT TACGTTCCAG TTTTGATGAA GTCAGAAGAA
CATGCTAAGA CCGAGGGTAA GCGCCCACTG TTCACACCGG AGACCGAAGA TCTCAAAGGC
GCGTTTAGAA TGCCAACGTT GGCTACCGCT ACGCCTACCT ATGCTTACGA ATCGCGGGTA
CTTCAAGGAA TATGGGCAGC CGCTCCATAC CTCCACAATG GATCGGTGCC AACACTAGCT
GAGTTACTAA AACCAGCAGC CGAACGGGTT CGTTCATTCA AAGTAGGCCC AGCTTATGAT
CTGGTTGATA TCGGACTTGC TGTCGAGCAA ACCCAGTTTG ACTATACTTT AGAGACTACC
GATTGCAGTG ATCGCAACTC AGGAAATAGT CGCTGTGGCC ATGAATTTGG TACCCAACTT
TCAGCGGACG AGAAAAAGGC GCTGCTTGAA TACCTTAAAA TTCTTTAA
 
Protein sequence
MKIKCIFWKN VTFALTILTP LPLLAEATSA PVAIDQGAEW TASAQKDFYS RDQGSRIMPL 
RWMAALKQPN GEPFMAASLS RYGYLPNEDS NPPGLPVGFT VASGSDGQYI GMTCAACHTR
QIEVAGTFYR IDGGPAIADF QSFLADLDTA VNTILTDQQA FKNFAHAVLG PSPTTSEENK
LHEAVQTWYL PYHTLMEGAL PPSPWGPARL DAVSMIFNRL TGLDIGPPPT YMIPENIKPA
TAPVRYPFLW NAAIQDKTQW PGFADNGNNI LGLARNLGEV YGVFGVFHPK KDKWRLLGIN
YLANNSANFQ GLNALENLVR KIGPPKWPWE VDQALASKGK EVFERKAEQG GCIGCHGIKP
GETRFLNQKT WATPIQDVGT DSKEYEILGW TVKTGVLEGA KIPFLAEPLK PVDTAFNVLG
TSVIGSILQH YVPVLMKSEE HAKTEGKRPL FTPETEDLKG AFRMPTLATA TPTYAYESRV
LQGIWAAAPY LHNGSVPTLA ELLKPAAERV RSFKVGPAYD LVDIGLAVEQ TQFDYTLETT
DCSDRNSGNS RCGHEFGTQL SADEKKALLE YLKIL