Gene Nwi_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0289 
Symbol 
ID3677117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp327254 
End bp329152 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content64% 
IMG OID637711829 
Productextracellular solute-binding protein 
Protein accessionYP_316908 
Protein GI75674487 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGC CCAGCCGTCG GCAGGTGTTC GGTCTTGGGA TCGGTGCGGC AGGCGCGGCA 
TGGCTGCGGC CGGCCGTCGC CGTGGCCGCG AACGGCGCGG AGCCGAACCC GCATGGCGAA
TCCCAAGGCC AATCTCACGG CATGTCGGCC TTCGGGGACC TGAAATATCC GGCTGATTTT
CACCACTTCG ATTACGTCAA TCCGGACGCG CCGAAGGGCG GGCTGTTCGC GACCATTCCG
TCGAGCCGCG CTTTCAATCA ATCGTTCCAG ACCTTCAACT CGCTCAACGC CTTTATCCTG
AAGGGCGACG GCGCGCAGGG CATGGGGATG ACGTTCACGT CGCTGATGGC GCGCGCCGGT
GACGAGCCCG ATGCGATGTA TGGCCTGGCG GCGAAATCGG TTCGCATCTC GGCCGATGGC
CTGACCTACC GCTTCACGAT GCGGCCGGAG GCGCGGTTTC ATGACGGACA AAAAATTACC
GCGCGCGATG CCGCCTTTTC CCTGATGGTC CTGAAGACCA AGGGTCATCC CCTGATCACG
CAGCAGGTGC GCGACATGGT GAAGGCGGAA GCGCCCGACG ACGCGACGCT CGTGGTGACC
TTCGCCGCGA AACGCGGTCG GGATGTGCCT TTGTTCGTCG CCGGCCTGCC GATCTTTTCG
CAAGCCTATT ACACGGCGCG CCCGTTCGAT GAAACGACGC TCGACGTTCC GCTCGGCAGC
GGGCCGTACA AGGTCGGCCG GTTCGAGGCA AATCGTTTCA TCGAATTCGA TCGCGTGAAG
GATTGGTGGG GCGCGGACCT TCCGGTGTGC CGCGGCGCTT ACAACTTCGA TACGGTGCGA
TTCGATTTCT ACCGCGACCG TGATGTGGCG TTCGAGGGCT TCACCGGCCG CAGTTATCTG
TACCGCGAGG AGTTCACCTC CCGCATCTGG AATACGCGCT ATGATTTCCC GGCGATGACC
GACGGCCGCG TCAAGCGCGA GCAATTGCCG GACGAGACGC CGTCCGGCGC GCAGGGCTGG
TTCATCAACA CCCGCCGCGA CAAGTTCAAG GATCCTCGCG TCCGCGAAGC GCTCGACTGC
GCCTTCGATT TCGAGTGGAC CAACAAGTCC ATCATGTACG GCGCCTATGT GCGGACGGTA
TCGCCTTTCC AGAATTCCGA TCTGATGGCG AGCGGTCCGC CGTCGCAGGA GGAGGTGGCG
TTGCTGGAGC CCTTCCGCGG CAAGGTGCCG GATGAAGTGT TCGGCAATCC CTACACTCCG
CTCGTCTCGG ACGGATCGGG ACAGGACCGC AAGCAGTTGC GCAGGGCCGC GCAACTGCTC
GACGAGGCAG GCTTTCATAT CAAGGACAGG AAGCGGATGA CCCCGCGGGG CGAGGTCTTC
CGCCTCGAAT TCCTGCTCGA TGAGCCGGCC TTCCAGGCTC ACCACATGCC CTATATCAAG
AATCTCCAGA CCCTCGGCAT CGAGGCGACG CTGCGGCTCG TGGACCCGGT TCAGTCGCGC
TCGCGGCGCG ACGACTTCGA TTTCGACATC ATCATCGAAC GTTTCAGTTT CTCGACCATT
CCAGGCGATT CGCTGCGGCC GTTCTTTTCG TCGCGCGCGG CGGCGACCAA GGGCTCGAAC
AACCTGGCGG GCATCGCCGA TCCCGCGATC GATGCGCTGA TGGAGCAGGT CATCGTCGCC
GACACCCGCG CCAGGCTCGT CTTCGCGGCG CGCGCGCTGG ATCGCGTGAT TCGCGCCGGC
CGCTATTGGG TGCCGCAATG GTATTCGAAC ACGCACCGGC TGGCCTATTG GGATGTGTTC
GCCCATCCGC CGAGCCTGCC GAAATACCTC GGCGTCATGG CGCCTGATAT CTGGTGGTCG
ACACAGGCCC GGCCGGCATC ATCCGGGCAG GCGGGATAA
 
Protein sequence
MSRPSRRQVF GLGIGAAGAA WLRPAVAVAA NGAEPNPHGE SQGQSHGMSA FGDLKYPADF 
HHFDYVNPDA PKGGLFATIP SSRAFNQSFQ TFNSLNAFIL KGDGAQGMGM TFTSLMARAG
DEPDAMYGLA AKSVRISADG LTYRFTMRPE ARFHDGQKIT ARDAAFSLMV LKTKGHPLIT
QQVRDMVKAE APDDATLVVT FAAKRGRDVP LFVAGLPIFS QAYYTARPFD ETTLDVPLGS
GPYKVGRFEA NRFIEFDRVK DWWGADLPVC RGAYNFDTVR FDFYRDRDVA FEGFTGRSYL
YREEFTSRIW NTRYDFPAMT DGRVKREQLP DETPSGAQGW FINTRRDKFK DPRVREALDC
AFDFEWTNKS IMYGAYVRTV SPFQNSDLMA SGPPSQEEVA LLEPFRGKVP DEVFGNPYTP
LVSDGSGQDR KQLRRAAQLL DEAGFHIKDR KRMTPRGEVF RLEFLLDEPA FQAHHMPYIK
NLQTLGIEAT LRLVDPVQSR SRRDDFDFDI IIERFSFSTI PGDSLRPFFS SRAAATKGSN
NLAGIADPAI DALMEQVIVA DTRARLVFAA RALDRVIRAG RYWVPQWYSN THRLAYWDVF
AHPPSLPKYL GVMAPDIWWS TQARPASSGQ AG