Gene Nwi_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1754 
Symbol 
ID3676477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1923605 
End bp1925221 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content60% 
IMG OID637713314 
Productbacteriophytochrome 
Protein accessionYP_318367 
Protein GI75675946 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.478191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.160792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGCTG AGGGGTTTAG CAAAAGGCCG GTAAACGCGC GGGGGCGATT CAGACTGTCT 
CGATCCGAAT ACGATGAAAT CGTAGCGCCA AGAGCCAGGA CGTCACGACC CGCGCTCAGC
CAGTTGCTCC TGCTGACGGC AGGTCTGGTG GTTCTGACGC TGATCAGCGT CATTTCCATT
TATCTTGTCA ACAAGGCGCG CGACGACACT CGCTGGGTTC TCCGCAGCAT GGAAGTCGAG
AATGAAATTT CACTGGCCCA GTTACAGATA CAGCGCGCGG AAAGTTCGCA GCGCAGCTTT
CTGCTGACGC AGCGCCCGGA ATTCCGGGCT GCGTTCGAAG ATGCATCAGG CCGGATTCCG
CTGAGCTTCG AACGGCTGAG GGTACTGACC AAAGAGAATC CCTACCAGCA ACAGCGGCTC
CAGCAAATGG TCCCGCTGAC CAGTCAGCGG ATCGAGCAGT TGCGGCAGAG CATTGATCTC
GCGCTGACGA ACCGGCTGGA TCAGGCCGTT GAGAGCGCGC GACGAGAGGC TGGTCAGGAC
GCGACGCGAC AGATTCGCGA TCTGGGGAAC GAGATGCGAA CCGAGGCAAA TCGTCTGCTC
GCGGCGCGGA CGGCGTCAGC CGATGCCAGC CAATCGCGTT CGGCGCTGAT TACCAGCGTC
GGCTCGGGGC TCGTTGTGCT GTTTGCCGGG CTCTCGATCT ACCTTGTGCG GCGGTCGAAC
CGGGAACGCG ACCTGGCTAA CGCCCGGCTG CGCGATATCA ATCTCAATCT GGAATCGACC
ATCGAGAAGC GGACCAGCGA TCTGCGGGAA GCCAACGATG AAATCCAGCG TTTTGCCTAT
ATCGTCAGCC ACGACCTGCG CTCGCCGCTG GTCAACATCA TGGGTTTCAC CAGCGAACTC
GAGGAACTGC GGGGCGACAT ATTCCGCCGG ATCGCGGAGC TGAGCCGGCC GGATAGCCTG
CCGCCCCCTG GCGTGAACGA TCCAGCCAGT GCGGCCGAAC CGGTTCTCGA TGGACCGGAC
GAGCAATTGT CGCAGGACTT CACGGAGGCT CTGGGATTCA TCAAGTCGTC GATCGCCAAG
ATGGATCGTC TCATCGGCGC GATCCTCAGC CTCACCCGCG AGGGCCGGCG CGAATTTCAT
CCCGTTTCGA TCGCCATGCG CGATCTGATC GAGGGCATTG TATCCACCGT GGCGCATCAG
GCCGCCGAAG CGGACGCCCA GATCCGGGTC GAGCCGTTGC CGGATATTGT TACCGACCGC
ATCGCGATCG AACAGATCTT CTCCAATTTG ATCGACAATG CCTTGAAATA TCTCAAACCT
GACGTTCCCG GCGATATCGC CATCAGTGGC CGGCGCAAGC TGGGTTTTGC TATTTTTGAG
ATCACGGACA ACGGTCGCGG CATCGATCCG AAGGATCACC AAAGGATTTT CGATCTGTTC
CGCCGGGCGG GGCCGCAGGA TCGGCCGGGC CAGGGTATCG GACTGGCGCA CGTCCGTGCA
CTGGTCCGAA GGCTCGGAGG GACCATATCA GTCTCGTCGA AGCTTGGTGC CGGCACCACG
TTTACAGTTA CGCTGCCCAG CACATGGATG GTCACAAACC GGGACGAACA ATCATGA
 
Protein sequence
MRAEGFSKRP VNARGRFRLS RSEYDEIVAP RARTSRPALS QLLLLTAGLV VLTLISVISI 
YLVNKARDDT RWVLRSMEVE NEISLAQLQI QRAESSQRSF LLTQRPEFRA AFEDASGRIP
LSFERLRVLT KENPYQQQRL QQMVPLTSQR IEQLRQSIDL ALTNRLDQAV ESARREAGQD
ATRQIRDLGN EMRTEANRLL AARTASADAS QSRSALITSV GSGLVVLFAG LSIYLVRRSN
RERDLANARL RDINLNLEST IEKRTSDLRE ANDEIQRFAY IVSHDLRSPL VNIMGFTSEL
EELRGDIFRR IAELSRPDSL PPPGVNDPAS AAEPVLDGPD EQLSQDFTEA LGFIKSSIAK
MDRLIGAILS LTREGRREFH PVSIAMRDLI EGIVSTVAHQ AAEADAQIRV EPLPDIVTDR
IAIEQIFSNL IDNALKYLKP DVPGDIAISG RRKLGFAIFE ITDNGRGIDP KDHQRIFDLF
RRAGPQDRPG QGIGLAHVRA LVRRLGGTIS VSSKLGAGTT FTVTLPSTWM VTNRDEQS