Gene Noc_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1997 
Symbol 
ID3704881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2299407 
End bp2300765 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content51% 
IMG OID637738473 
Productsigma-54 specific Fis family two component transcriptional regulator 
Protein accessionYP_343989 
Protein GI77165464 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02915] putative PEP-CTERM system response regulator 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTA AGAAAAACTT GCTGATCGTG GAAGATGATC TTGGCTTGCA GGGCCAGTTG 
CGGTGGGCTT TTTGTGGTTA TGAAATAGCA GTAGCTAAGG ATCGTCAGGA AGCGGTTGCA
TTAGTACGCC GCCATGAACC TCCGGTAGTA ACGTTAGATC TTGGTTTGCC TCCTAACCCT
GGTGGCGTCA GCGAAGGCAT GGCCTCCTTG CAGGAAATCC TGGCTCTGGC GCCCTATACC
AAAATCATTG TGATCACAGG GAATGACAGC CAGGAGCATG CGGTGCAAGC AGTGGGGGCG
GGTGCCTACG ATTTTTATTC GAAGCCCATC GATCCCGACA TTCTTAAATT GACGATTGAT
CGGGCTTACC GGCTTTATGA ATTGGAGATG GAAAATCGGC GTTTACGGCG GGCTGGTCAT
TCCTCCCTGG AAGGGGTGAT TGCCGTCAGC CCTGGAATGA AGAAAATATG TCGTACCATT
GAGAAGATCG CACCCGCTGA TGTCACTGCG TTTATTTTGG GTGAGAGCGG TACAGGCAAG
GAAGTTATTG CTCGGGCCTT GCACCAACTG AGCTATCGCC GAGAGCAAAC TTTCGTGGCT
ATTAACTGCG CGGCTATTCC GGAGAACCTT TTGGAAAGCG AGCTTTTTGG CCATGAAAAA
GGAGCCTTTA CGGGGGCTGT GAGACAAACC CGGGGCAAGA TTGAATACGC TCATGAGGGT
ACTTTGTTCT TAGATGAAAT CGGGGATTTG CCCCGGGGCC TCCAGGCCAA GCTGCTGCGT
TTTCTGCAAG AACGGGTGAT TGAGCGGGTG GGTGGCCGTG AAGAAATCCC TGTGGATGTG
CGGGTTATCT GCGCGACCAA CCAGGATTTA AAAGAGCTTA TTGCTCAAAA CCAATTTCGG
GAAGATCTAT ATTATCGAAT TGCCGAAGTT ACCGTGACTC TTCCGCCTTT GCGGGAACGC
CCGGGCGATG CGGTAGTCAT TGGGCGGGCG CTGCTTGAGC ATTTTTCGCG CACGCAGGGT
AAAGCGGTCC GCGGTTTTAC AGATGATGCT ATTAGAGCTA TTGAGACTCA TACCTGGCCT
GGCAATGTTA GAGAACTAGA AAATTGCATA AAACGGGCTG TGATAATGGT GGAAGGCAAC
CGTATTGCGT CGGAAGATTT AGACTTGCCC GCCTCTGCCT CCCCCGAACA GCAATCTTTG
TCCTTAAATT TGCGTCAAAT ACGGGAGCAT ACTGAACGGG AGGCGCTTAC CCGCGCCATC
ACCTTGGTGA ATGGCAATCT CTCACGGGCA GCGGAACTCT TAGGGGTGAC TCGTCCTACT
TTATACGCAT TATTAGATAA ATATGAAATG CGTGGTTAA
 
Protein sequence
MSSKKNLLIV EDDLGLQGQL RWAFCGYEIA VAKDRQEAVA LVRRHEPPVV TLDLGLPPNP 
GGVSEGMASL QEILALAPYT KIIVITGNDS QEHAVQAVGA GAYDFYSKPI DPDILKLTID
RAYRLYELEM ENRRLRRAGH SSLEGVIAVS PGMKKICRTI EKIAPADVTA FILGESGTGK
EVIARALHQL SYRREQTFVA INCAAIPENL LESELFGHEK GAFTGAVRQT RGKIEYAHEG
TLFLDEIGDL PRGLQAKLLR FLQERVIERV GGREEIPVDV RVICATNQDL KELIAQNQFR
EDLYYRIAEV TVTLPPLRER PGDAVVIGRA LLEHFSRTQG KAVRGFTDDA IRAIETHTWP
GNVRELENCI KRAVIMVEGN RIASEDLDLP ASASPEQQSL SLNLRQIREH TEREALTRAI
TLVNGNLSRA AELLGVTRPT LYALLDKYEM RG