Gene Noc_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1703 
Symbol 
ID3704625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1906186 
End bp1907292 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content50% 
IMG OID637738184 
ProductPAS sensor, signal transduction histidine kinase 
Protein accessionYP_343705 
Protein GI77165180 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.301457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGGG AAATTCAGCT GCCTTTCGAG CGGCTATTTG ACACTGCCCC CGATGCCATG 
GTGATCTCTG ATCATCAGGG CCGCATCGTC CTCGTTAACG CCATGGCTGA ACGGATGCTT
GGCTATTCGC GAGCAGAACT AATAAGCCAG CCTCTTGAGA TTCTGGTACC AAAGCAACAC
CGCGATGGCC ACGCCCGCCA GCGTCAAAAG TATTATCGTC ACCCCCGCAC TCGCCCCATG
GGCGAAGGTC GTGAACTCTA TGCAGTACGC AAGGACGGCA GCATGTTTTC GGCCGAGATC
AGCCTCAGCC CCATGGAAGT AGATGGCCGC TTATTAATTA CGAGCGCTAT CCGAGATATT
ACGGAACGCA AACAAATACA GAAAACACTA GAACAGCAAA CCCAGGATCT CATGCGCTCT
AATGCCGAAT TAGAACAATT CGCCTATGTG GCTTCCCATG ACCTCCAGGA GCCGCTTCGA
ATGGTCAGCA GTTATGCCCA GTTGCTCGCC CGTCGCTACC GGAATCAATT AGACTCCGAT
GCCGATGAAT TCATCGAGTT CATGGTGGAT GGGGCCACTC GAATGCAGGC ACTCATCAAT
GACTTGCTCG CTTATTCCCG CGCAGGCACC AAAAATAGAA CTTTTGCCAC AACCAATAGT
AACGGGGTAG TCCGGCAGGT TCTGGAAAGC CTCCAATTTG TAGTTAAAGA AACTCAGGCC
TCCCTGACTG TTGATCCTTT GCCCCTGTTA ATAGCAGATG AAGCCCAACT CGCACAACTA
TTCCAGAACC TCATCAGTAA TGCCTTAAAA TTTCGGGGAG AAACGATACC GAGAATCCAT
ATTAGCGCTA AAGAGGAAGA GAATGAAATT ATCTTTTCCA TAGCTGATAA CGGGATTGGG
ATTGAACCTC AATACGCCGA ACGAATTTTT TTACTTTTTC AGCGCCTGCA TAGCAAAAGG
GAATATCCCG GCACAGGTAT TGGCCTCGCC ATTTGCAAGA AGATCGTGGA ATGTCACGGG
GGGCGGATTT GGGTAGAATC CAAGCAGGGC AGGGGGGCTA CGTTTTTCTT CACCTTGCCA
TTCAAACCAG AAAAACCCCT ACCATAA
 
Protein sequence
MTREIQLPFE RLFDTAPDAM VISDHQGRIV LVNAMAERML GYSRAELISQ PLEILVPKQH 
RDGHARQRQK YYRHPRTRPM GEGRELYAVR KDGSMFSAEI SLSPMEVDGR LLITSAIRDI
TERKQIQKTL EQQTQDLMRS NAELEQFAYV ASHDLQEPLR MVSSYAQLLA RRYRNQLDSD
ADEFIEFMVD GATRMQALIN DLLAYSRAGT KNRTFATTNS NGVVRQVLES LQFVVKETQA
SLTVDPLPLL IADEAQLAQL FQNLISNALK FRGETIPRIH ISAKEEENEI IFSIADNGIG
IEPQYAERIF LLFQRLHSKR EYPGTGIGLA ICKKIVECHG GRIWVESKQG RGATFFFTLP
FKPEKPLP