Gene Noc_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1356 
Symbol 
ID3706120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1502103 
End bp1503422 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID637737851 
Producthypothetical protein 
Protein accessionYP_343380 
Protein GI77164855 
COG category[R] General function prediction only 
COG ID[COG4287] PhoPQ-activated pathogenicity-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.440655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCTC CTTTTTTTGG TCGATCTTCG AGAGCGGTTT TTTTGATTAT AGCGCTCTTT 
TTTATCACTA AGCCGGCTTT TGCGGGCGCG CTTGAAGATT ATGTTCGCAA GCCTGATCCT
CATTACAATT GGAAACTGAC GGAGCAGAAA GAAGAGCACT GGGGAACTAT GGCTTATCTG
GAATTAGTGT CCCAGCATTG GCGCAATCAA TTTTGGAGTC ATCGTCTCAT TATAGCTCAG
CCCAAGGAAG TCCGGAATCC AGAAATCGGT TTACTTTTGA TCGCGGGTGA GGGAGACGGG
GAGAAGTATA TTGAGCGGTT GAAAATGCTC GCCCAGCGCG CAGGCGCGGT GGCCGCCGTT
ATCACTCAAG TTCCCAATCA GCCGCTCTAT AATGGCCTCA AGGAGGATGC CTTGATTGCC
TTTACTTTGG CTCAGTTTTT AAAAACTGGC GATGAAACGT GGCCATTGCT GTTTCCCATG
GTGAAGAGTG CGGTTCGGGG TATGGATACC CTTCAGGCAT TCCTGGAGCG GGCATTTCAG
CAGAAAATTG AGGGCTTTGT AGTGGCGGGG GCCTCCAAGC GAGGCTGGAC CACTTGGCTA
ACCGGTGCGG TAGATTCGCG GATAAAAGGG TTGGCTCCGA TGGTCATTGA TATGCTGAAT
ATGGAGCAGC AACTGCATTG GGCTGAAAAA GCGTATGGCA GGCAAAGCGA AAAAATTAAC
GATTATACGG AGCTTAGCCT TCATCAAAAT CAAGATGATC CGGCCGTGGC AAAACTGCGT
AGTTGGATAG ATCCCTATGA ATACCGACAG CACTATACGA TGCCTAAACT TTTGCTCTTA
GGCACGAATG ACCCTTATTG GGTGGTGGAC TCGCTGCGGC ACTATTGGAA CGAGTTGCCG
GCGCCTAAGT TAATTTTTCA GACGCCTAAC GCAGGGCACG ATCTGAACGG CGGTAAACAA
GCGATGCAGA CCTTGGCCGC ATTTTTTCAA ATGATTGCTG ATGGTCAGGA TCTGCCCCAA
TTAGAATGGG AGTTACCAGC TAGCGACGCG GGAGAGCCAA GTGTTAAGGT AACGAGCGGG
CAATCGGTTC GGGCAATTCG ACTTTGGACG GCCACATCAG AGGATCGAGA TTTTCGTGAT
GAACATTGGT CGAGCCGCAG TCTGAAAATC CTGCCAGGCA GCAGGCACGC AATAGCCAAA
GTGGTTATTC CAGAACAGGG ATATCGCGCC TATCTGTTTG AAGTAGAGAT GACCACATCG
ACAGGGCATC CTTATAAACT TTCCACGGAA GCCCGTGTCC TACCCGATGA TATTAAATGA
 
Protein sequence
MLSPFFGRSS RAVFLIIALF FITKPAFAGA LEDYVRKPDP HYNWKLTEQK EEHWGTMAYL 
ELVSQHWRNQ FWSHRLIIAQ PKEVRNPEIG LLLIAGEGDG EKYIERLKML AQRAGAVAAV
ITQVPNQPLY NGLKEDALIA FTLAQFLKTG DETWPLLFPM VKSAVRGMDT LQAFLERAFQ
QKIEGFVVAG ASKRGWTTWL TGAVDSRIKG LAPMVIDMLN MEQQLHWAEK AYGRQSEKIN
DYTELSLHQN QDDPAVAKLR SWIDPYEYRQ HYTMPKLLLL GTNDPYWVVD SLRHYWNELP
APKLIFQTPN AGHDLNGGKQ AMQTLAAFFQ MIADGQDLPQ LEWELPASDA GEPSVKVTSG
QSVRAIRLWT ATSEDRDFRD EHWSSRSLKI LPGSRHAIAK VVIPEQGYRA YLFEVEMTTS
TGHPYKLSTE ARVLPDDIK