Gene Nwi_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2010 
Symbol 
ID3674198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2198062 
End bp2199180 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content68% 
IMG OID637713574 
ProductSMF protein 
Protein accessionYP_318621 
Protein GI75676200 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.272582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATCC GGGATCCAGA CAGACGGCTG AGCGAAGCGC AGCGGACCGA CTGGCTGCGG 
CTGATCCGGT CCGATAATGT CGGGCCGCGA ACCTTTCGTT CGCTCGTGAA CCACTTCGGC
AGCGCCAGCG AGGCGCTCGT CCGCCTACCC GATCTGGCGC GGCGTGGCGG CGCTTCCCGC
CCCAGCCGGC TTTGCACGGA AGCCGAGGCG CGGAACGAAC TGGCGGCCGC GCGGCGGATC
GGCGTCAGCC TGCTGGCGCC GGGAGAGGCC GGCTATCCCC CGCACCTTGC AACGATCGAC
GACGCCCCGC CGCTGCTTGG CGCGCGCGGC AACCTCGATG TCATGGAGCG TCCCATGATC
GCCATCGTCG GTTCACGCAA CGCATCGGGC GCCGGTCACA AGTTCGCGCA AACGCTGGCG
CATGATCTCG GTGATACCGG TTTCGTCATC GTATCGGGGC TGGCGCGCGG CATCGATCAG
GCGGCGCATC GCGCCAGCGT GGCGCGCGGC ACCGTCGCGG TGCTTGCCGG TGGCCACGAC
CGCATCTATC CATTGGAACA TGAGGATCTG CTGGCAGCCG TATTGGAAAG CGGCGGCGCG
ATTTCTGAGA TGCCGATGGG GCATGTCCCG CGGGCCCGCG ATTTCCCGCG GCGCAACCGC
CTGATCTCGG GCGCCGCGAT CGGCGTTGTC GTGGTCGAGG CGGCGCATCG TTCCGGCTCT
CTGATCACCG CCCGCATGGC CGCCGAGCAG GGCCGCGAGG TTTTCGCCGT GCCGGGCTCA
CCGCTCGATC CGCGCGCCAC CGGCACCAAT GATCTCATCA AGCAGGGCGC GACGCTGATC
ACCGAGGCAG CCGACGTCAT TAATGCTGTC CGGCCGATTA TAAGACGGCC GGTTGATCTG
CCCGCCGAAG AACCGGAGCC CGGCGAGCCC TGGACCGAAG AGCCCGCCGC AAGCGACCGT
GCGCGGATCA TCGCCTTGCT CGGTCCGGCG CCGATCGGAC TTGACGATCT GATTCGGATG
GCGAATGCCC CGCCGGCCGT CGTGCGCACG GTCCTGCTTG AACTGGAACT GGCCGGACGG
CTGGAGCGCC ATGGCGGCGG CCTGGTATCA ATGATGTGA
 
Protein sequence
MNIRDPDRRL SEAQRTDWLR LIRSDNVGPR TFRSLVNHFG SASEALVRLP DLARRGGASR 
PSRLCTEAEA RNELAAARRI GVSLLAPGEA GYPPHLATID DAPPLLGARG NLDVMERPMI
AIVGSRNASG AGHKFAQTLA HDLGDTGFVI VSGLARGIDQ AAHRASVARG TVAVLAGGHD
RIYPLEHEDL LAAVLESGGA ISEMPMGHVP RARDFPRRNR LISGAAIGVV VVEAAHRSGS
LITARMAAEQ GREVFAVPGS PLDPRATGTN DLIKQGATLI TEAADVINAV RPIIRRPVDL
PAEEPEPGEP WTEEPAASDR ARIIALLGPA PIGLDDLIRM ANAPPAVVRT VLLELELAGR
LERHGGGLVS MM