Gene Nwi_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0474 
Symbol 
ID3676506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp531419 
End bp533086 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content66% 
IMG OID637712016 
ProductHemY protein 
Protein accessionYP_317093 
Protein GI75674672 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGGA TCATCCTATT TCTCGTTCTT ATCGCTCTCT CTGCGCTGGG CGCCACCTGG 
ATCGCCGATC AGACCGGCAC GATCGTTCTG TCATGGGACG TCTGGCGGAT CGAGACCACG
ATCCCGGTGT TCGCGCTCGG GCTCGGCCTG CTGATCGTTG CCTCGCTCCT GGCCTGGAGC
GTGGTGCACG GATTGTGGCA GGCGCCGACA CGTATGCGCC GCGCCCGCCG GGAGCGTCGG
ATCGCGCGCG GCAGGGACGC CATCACGCGG GGTCTGCTCG CGATCGGTTA CGGCGATGCC
GCGGCCGCCC GCAATCACGC CAAGGCGGCC CGGCGTCTGG TGTCGAACGA CCCGCTGGCG
CTGCTCCTGC ATGCCCAGGC CGCGCAGCTT GACGGGGACG CGGACCGGGC GCAGCGTGCT
TTCCGGACCA TGGCCGAACG CAACGATACG CGGCTATTGG GCTTGCGAGG GCTTTTCATC
GAAGCCCAGC GTGCCGGCGA TCCGGCGGCG GCGGTGGCGA TCGCGGAGGA AGCGCTCAAG
ACCTCGCCAT CCTCGATATG GGCTTCGCAA GCCGCGCTCG GCTTCCGTTG CGCCAGAGGC
GACTGGACCG GCGCGCTCGC CATCCTCGAC AAGAACGTGG CGTCAGGTCG TATCGACAAG
GCGCTGTATC GCCGTCAGCG CGGGGTGCTG CTGACGGCGA GGGCGTTGGA GCTTGAAAAG
TCCGATCGCG ACCTGTCGCG CCAGACCGTA ATGGAAGCGG TGAAGTTTGC GCCGACGCTG
ATTCCGGCCG TGGTGCTTGC GAGCAAGTAT CTGAGCGAGG CGAACCAGAT GCGGCGCGCG
ATGCGTCTCA TCGAAACCGC CTGGCTGGCC CAGCCGCATC CTGATCTCGC CGATGCCTAT
GCGCATGTGA TGCCGGGCGA TTCAGCGCTG CAGCGATTGG CGCGCGTCGA GAAACTATCG
GAGAAGGCGC CCGGTCATCC GGAAAGCGCG ATGGCGATTG CGCGCGCTGC GATCGACGCC
AGCGAATTCG CGCGGGCCCG CGAAGTTCTG GAGCCGCTGA TCGCAGCGCC GACGCAACGC
GTTGCGATGC TGATGGCCGA AATAGAGCAC AACGAGCGCG GCGATAGCGG GCGCGCCCGC
GCTTGGACGT TGCGCGCCGT GCGTGCCCTG CACGATCCGG TATGGACCGC GGATGGGTAC
GTGTCCGACC ATTGGCGTCC GGTTTCGCCG TTAACCGGGC GGGTTGACGC GTTTCAGTGG
CAAACTCCGT TGTCGGCCTT GCCTTCGAGC AGGCCTCCCT TGGTAGAGGC GGAGGTTTCG
GACCAGATCG CGGAAGATGC GCCGCGCGTC GAAGCTTTGC CTGCCGCGAC CTCCACCGAG
GATTTCAAGG CTCAAGACGG CGCGATACCC CCGGCCGGGG TCGAAGCCTC ATCGTCTGGA
ACGATTTCCA TGCCGGACGA GACCGCGACG GCATCGGCCG ATTACACGCC ATCAGAGGAA
GACAACAAGC CCGGAGCGAT TCCCGACGTG ATCCCGATCG TTCGTGCTCC GGACGATCCG
GGCGTCGATG ATGACAATCC CCGCAATGAC GGGTTTGTCG AGGACGATAC GGCTGCGGCG
CGCCAGGCGG GAGGATTGCG CGGGCTCCTG TCGCGCAGGG GGAACTGA
 
Protein sequence
MLRIILFLVL IALSALGATW IADQTGTIVL SWDVWRIETT IPVFALGLGL LIVASLLAWS 
VVHGLWQAPT RMRRARRERR IARGRDAITR GLLAIGYGDA AAARNHAKAA RRLVSNDPLA
LLLHAQAAQL DGDADRAQRA FRTMAERNDT RLLGLRGLFI EAQRAGDPAA AVAIAEEALK
TSPSSIWASQ AALGFRCARG DWTGALAILD KNVASGRIDK ALYRRQRGVL LTARALELEK
SDRDLSRQTV MEAVKFAPTL IPAVVLASKY LSEANQMRRA MRLIETAWLA QPHPDLADAY
AHVMPGDSAL QRLARVEKLS EKAPGHPESA MAIARAAIDA SEFARAREVL EPLIAAPTQR
VAMLMAEIEH NERGDSGRAR AWTLRAVRAL HDPVWTADGY VSDHWRPVSP LTGRVDAFQW
QTPLSALPSS RPPLVEAEVS DQIAEDAPRV EALPAATSTE DFKAQDGAIP PAGVEASSSG
TISMPDETAT ASADYTPSEE DNKPGAIPDV IPIVRAPDDP GVDDDNPRND GFVEDDTAAA
RQAGGLRGLL SRRGN