Gene Nwi_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1806 
Symbol 
ID3677061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1978658 
End bp1980070 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content66% 
IMG OID637713368 
Productproline-rich protein 
Protein accessionYP_318419 
Protein GI75675998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.321143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.323939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCATC GATATCAGGA CCGGCCCTTC CCGGAGGACG ACGATTACGG GCGCGACGAC 
CAGCGCGCGA CTGCTCCCAC GGAGCCGGAT CCGCTTGCGG AACTGGCGCG CCTGATTGGT
CAGGGCGATT CTTTCGGTGA GCCCAGGCGG GAAACGCCGA TGGCTCCCGT GGATGATCCC
GTCGACCATT TCGAGCCGCT GCCGCCGATC GAGGAACCGA TCGAGGGGGA GCCACCCGCC
GGACCGCCGT CCTGGATTCA AAGGCGGATG GCCCGGCCCG ATCCTTCGCC GGAGCAACTC
GAATCCTCGG CGCATCCGGT GCTGCGGCGC GCGGCGGTTT ATCCAGACAG CCATCAGCCT
GAGCCGGAAT CTTATCAGGC CGATCCCGGC CGTTACGACG ATGCCCGCTA CGGGAATTTG
CCGGACGATC AGCCGGCCTA TGCGTATGGC GATCGCGTTC CGCACGATGC TTATGAAGGC
CAGTATCCGC GCGAGACCTA TCCGGAAGCG CCCTTCGGCG ACGAGAACGG CTATGGCGAA
TCTTACGGCT ACGGGGAGGC CGAAGATGAA CCGGCTCCCG GCCGTCGCGG AAAAATGGCG
ACGGTGGTCG CGGTTCTGGC GCTCGCGGTG GTCGGCACCG GCGCGGCGTA TGCTTATCGC
ACCTTTGTCG GGAACGCGCC TAACGGTGAG CCGCCGATCA TCAGGGCCGA TACCGAACCC
AACAAGATCG TGCCTCAGCA GACGACTTCG GGTGATGCCG TCGGCAAGTT GCTTCAGGAC
CGGATGTCCG TGGAAAACGG CACCGAGCAG GTGGTGTCTC GCGAAGAGCA GCCGGTCGAT
GTCAAAAGCG CCGTGAGCGC CGGCCCGCGT GTGGTTTTTC CGCCACTGAA CCAGAACGAT
AACCCGCCGT CCGCGGCTAG CGTGGCTCCG GATATCAAGC CTCCCGCGAC CGTTGCAAAC
AGCGCGCTCG GGGGCGACGA GCCGCGCAAG GTCAGGACCC TCGCGGTCCG CGGTGATCAG
GCCGATCTCG CCGCCGCGTC CGCCGCCAGC CGGCATGCCG CTGCGGCGGG GGCCGCGCCC
GGAGCCAATG TGCCGCTCTC GCAGACCGCC GACACGCGAA CCCGGATGGC GTCGACCCAT
CCGGCGCAGC AGTCGTCCGC CAGCGGGGGC TATCTGGTCC AGGTTTCCTC GCAAAGGAAT
GAGGCGGACG CGCAGGCGTC GTTCCGGGTG CTACAGGGTA AATTCCCGTC GATGCTGGGA
TCGCGTACGC CTGTGATCAA GCGCGCCGAC CTCGGCGCCA AGGGTGTCTA CTATCGCGCA
ATGGTCGGGC CGTTCGCCTC GTCCGGCGAG GCATCGCGTT TCTGCGGAAG CCTCAAATCG
GCCGGAGGAC AGTGCGTCGT CCAAAGGAAT TAA
 
Protein sequence
MAHRYQDRPF PEDDDYGRDD QRATAPTEPD PLAELARLIG QGDSFGEPRR ETPMAPVDDP 
VDHFEPLPPI EEPIEGEPPA GPPSWIQRRM ARPDPSPEQL ESSAHPVLRR AAVYPDSHQP
EPESYQADPG RYDDARYGNL PDDQPAYAYG DRVPHDAYEG QYPRETYPEA PFGDENGYGE
SYGYGEAEDE PAPGRRGKMA TVVAVLALAV VGTGAAYAYR TFVGNAPNGE PPIIRADTEP
NKIVPQQTTS GDAVGKLLQD RMSVENGTEQ VVSREEQPVD VKSAVSAGPR VVFPPLNQND
NPPSAASVAP DIKPPATVAN SALGGDEPRK VRTLAVRGDQ ADLAAASAAS RHAAAAGAAP
GANVPLSQTA DTRTRMASTH PAQQSSASGG YLVQVSSQRN EADAQASFRV LQGKFPSMLG
SRTPVIKRAD LGAKGVYYRA MVGPFASSGE ASRFCGSLKS AGGQCVVQRN