Gene Nwi_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2040 
Symbol 
ID3675819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2228973 
End bp2230238 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID637713604 
ProductMotA/TolQ/ExbB proton channel 
Protein accessionYP_318651 
Protein GI75676230 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0811] Biopolymer transport proteins 
TIGRFAM ID[TIGR02797] tonB-system energizer ExbB 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.853694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCCG GTTCCGGACC TCCAGAGCAC GCCAGCCGGT GGCGACGCTC GCAACGAAAG 
ATCATACCCA TGATGCGTAA CATCCGGACA ATCGCGGCGG GCGAACTCCC CGCAAGGATC
GCTCGCGGCC TGCTTGCCGC CGCGTTCATA CTCGCCATCC TGCCTGGTGA CGCCGACGCA
CGGGACCGGC GACCGCACCC GCAGCCTGAC GTTACGGCGT CACAGCCGGC CGCCACGCCG
CCGGCGTCCG CTGCGGACGC CGCACCATCG ACGCCTTCCG CGGCAGCGCC CGCGCCTTCC
GGGAGCGCGC CGGCTGGGGA TACGGCGGCG ACACCGGTTG CGCCGGTCGC TTCGCCTGGA
CCAGACACGC CCGCGCCATC ATCTGGCGAG ACGCCTTCGC CCGCGTCTGC CGATGGTGCA
TCGGGGGAGG GGTCTGCTTC ATCTGATGCG TCTGCGGGTG CATCGTCGGA CGCGGCCGCG
CCGTCGGATC AATCCGCACC TGCCGCTGCG CCGTCTGCGC CGGAGGGCTC GTCGTCGACG
CTTCCGCAGG ACCTGTCGCC GTGGGGCATG TTCATGCAGG CGGACATCGT CGTGAAAGCG
GTAATGATCG GTTTGGCGTT CGCATCATTG GTAACGTGGA CGGTCTGGCT GGCCAAGGGG
ATCGAGCTCG CGGCGTCGCG GCGGCGGGCG CAGAGCTGCG TGCGCAAGCT GGAGCGCGCC
GACAGCCTCG ATGCCGCGCG CTCCGAGATC GCCGAGGGCT GGACCTGTGA GGGCACAGTG
GCGGATCTGA TGGGGGCGGC GGAGCGCGAG CTGCAGCGCT CGGGCGATCT GTCGTCCGAG
GGCATCAAGG AGCGGCTGGC GATCGCGCTG TCGCGCATCG AGGCCAAGGC GGGGCGCGAC
ATCGCCCGCG GCACCGGCCT GCTGGCGACC ATCGGGGCGA CCGCGCCGTT CGTCGGCCTG
TTCGGCACGG TGTGGGGCAT CATGAACGCC TTCATCGGCA TCTCGCAGAC CAAGACCACC
AACCTCGCGG TGGTGGCGCC GGGCATTGCC GAGGCGCTGC TTGCGACCGC GCTCGGGCTT
GTCGCCGCGA TCCCGGCGGT GATCATCTAC AACGTGTTCG CTCGCGCGCT TTCGAGCTAC
CGGGCGCTGC TGTCGGACGC CTCCGGCGAG ATCATTCAGC ACATCTCGCG CGATCTGGAG
CGGCAGGAGC GGGGGCTCGG CCCGTCCGTG GTGGCGCTGT CGCGCGGCCG CGCTGCGGCT
GAGTGA
 
Protein sequence
MPAGSGPPEH ASRWRRSQRK IIPMMRNIRT IAAGELPARI ARGLLAAAFI LAILPGDADA 
RDRRPHPQPD VTASQPAATP PASAADAAPS TPSAAAPAPS GSAPAGDTAA TPVAPVASPG
PDTPAPSSGE TPSPASADGA SGEGSASSDA SAGASSDAAA PSDQSAPAAA PSAPEGSSST
LPQDLSPWGM FMQADIVVKA VMIGLAFASL VTWTVWLAKG IELAASRRRA QSCVRKLERA
DSLDAARSEI AEGWTCEGTV ADLMGAAERE LQRSGDLSSE GIKERLAIAL SRIEAKAGRD
IARGTGLLAT IGATAPFVGL FGTVWGIMNA FIGISQTKTT NLAVVAPGIA EALLATALGL
VAAIPAVIIY NVFARALSSY RALLSDASGE IIQHISRDLE RQERGLGPSV VALSRGRAAA
E