Gene Nwi_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2072 
Symbol 
ID3675487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2269061 
End bp2271421 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content57% 
IMG OID637713638 
ProductTonB-dependent receptor 
Protein accessionYP_318682 
Protein GI75676261 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.516649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGAA TATTGTTCAT CGGCGCGCTG CTTGGCGGCG AGTCGTACGG CGCGTTGGCC 
GTAGCGCAGG ATTCAACGAA TCTTCCGCCC GTGACCATTG TCAGCGATGG TCAGCCGAGA
AATGTGCGGC AGAACAGCAC GACGAATGCA CGGGGCCGGA CCGCGCGACG TGCGCGACAG
GCTGTGCCGA GCAATCAGCA ACCTGTCCCG CAGCAACAGC AACAATCTGG CGGTGTTCGG
GCATCGCAAA ACTCCGTCCT CAGCACCACG CCGCAGTTAG CCGGCGCCAG CAGCGTGACA
CAGCAGGGCA TCGCGATTCT GGGCGGACCG GCGCAGACGA GCTTCTACCA GCCGCTGGCC
CTGATTCCCT CTGTTTCGGT TCAGACACCG GATCCGTATG GTTTGAATAC GACGCGAAAC
ATTAATATTC GCGGCAAGGG AGATTTTCAT CTTTCACGCA CGATCGACGG CTTACCGCTA
ATGGGAATTG TGGGCGGCTC CGATCTGTTC GATCTGGAGA ACATTGGGCG TATCGACGTT
TATCGCGGTG CCGTGCCGTC CGACAAAGGC ATTGGCCTGT CGAACGCGAC CGGCGTGATC
AACCAGCTCA CGCTGAGACC ACAGGATAAA GCTGGCTTCA CGGCGCGCCA AGCCTTCGGC
ACCGACAGCT TCTATAAGAC ATTCGTTCGA ATCGACAGCG GCCTGAATCC GGAGACGGCA
ACCAAAGCCT TTCTGTCCGG GTCCAATATC GGTGTCGACA AGTGGACGGG CGCTGGCGAC
CAGAAACGCC AGAACGTCAC GTTCGGGTTG AGCCAGGATT TCGGTGACCG TATCACCCTC
GACGTTACCG CGGTCTACAA CAACTACGCC GAGAATTTCT TTCGAGCGCT AACGTATCCT
CAGACCACAA ATCTAAGAAA CAACTATAGT CATGACTTCA ACACCACGCT GACGGGCGTC
GCGGCAACCG ACGTCAACTA CTACAAGTTT AACCGGATGA ACGCTAAAAC GTTCGCGACT
TTCGCAAAGC TTGATTATAA CTTTGCCGAA GGACAGCATC TGCTGTTCAA GCCATACTAC
TGGGACAACG ACACCACGCG ATACAACGCG GCCGGAAGTA ACGTTCAGAT CTGGCGTCAG
CAGAACCAGA ACGTCGGTAG CGTTTTCGAG TACGCCGGGC AGTTTCCCTG GGGAACCGAC
GTCGTTGTCG GATACTGGTG GCAATCGATG AAGCCGCCGC CACCTCCGAC GGATCAGCGG
CGTTTCACCG TGGATGCGGC CGGCGGACTT GATTTCTCCC ACTGGCAGAG TCTTGCGAGA
ATCGACAACT TCAGCGTCAA CAGTCCGTAC TTTCAGGTTT CGCAGAATTT TGGGTCAACC
TTTGTGACCA GCGGTCTGCG CTACATGGTT CTCGGGGCAC CGCAGATGTT GTACTACAAC
ACTGCGGGAA TTTCTGACGG CACTCACGGT CAGGCGCTGG CATTGAACCC CGCGATCTAT
CCCGATGCCA CCCTGGCTGC ACGGGACTAT ACCGCATGGC TTCCCAACGT CGCGATTCGT
CACGATCTCA ACCCGGCTCT CAGCGTGAAT TTCAGCTATG GACGCAGATT CGGCCGCCCG
GATTGGGGAC CGCAAGCAAG CAATTACATC AGCAACCGCA CGGTGTTCAC CGCGAGGGGG
TTCTCGCTGC AATCGCTGGT CGACAAGGTG CGGCCGGAAA TCTCTGATCA GTTTGATGCT
TCGCTACGCT TTAGCCAGTA CGGGCTGACT GTGATCCCAA CCCTGTTCTA CGCCAAATAC
CAGAACAAGC AGGTCAAGGT CATCGATCCA TCGATCGGCC CGAACATCGC ATACTTTCAA
GGCACCGGAT CGAGCACCGG ATACGGCTTT GAGCTTGAAG CGAACTATAG GTTCGACGAG
CAGTTCTCAG TCTTCGGTTC GACGACGCTG GCATCCGAAA CATTCGATTC CGATACGCCA
ACCCTGAGCG GCGGCGCCAT GCTGGCGACC AAGGGTAAGC AGATCCCGAA CACGCCGCAA
GTCATGATAA AGGGCGGAGT AACCTACCAG GTGGACCGTC TGGCGATCAT GCCGATCGTA
CGTTACATCG GCCCGCGTTT CGGTGAAGCC GCCAACACCC AGCGTGTATC CGGATACACC
GTTGCGGATT TGACGATGTC CTATGATCTC GGGTCCCATT TTGGTGTCGA ATCCCTGAAT
GCGAGCTTCT CGATCCAGAA CATCTTTGAT CGTCAGTACA TCTCACAGAT CTCTCCCAGC
GACATCGATC TGAGCGCGGG CGCGACCTAT TTCCTGGGAG CGCCGCGAAC GGTTGTCGGA
TCACTGTCGA TGAAATTCTG A
 
Protein sequence
MKRILFIGAL LGGESYGALA VAQDSTNLPP VTIVSDGQPR NVRQNSTTNA RGRTARRARQ 
AVPSNQQPVP QQQQQSGGVR ASQNSVLSTT PQLAGASSVT QQGIAILGGP AQTSFYQPLA
LIPSVSVQTP DPYGLNTTRN INIRGKGDFH LSRTIDGLPL MGIVGGSDLF DLENIGRIDV
YRGAVPSDKG IGLSNATGVI NQLTLRPQDK AGFTARQAFG TDSFYKTFVR IDSGLNPETA
TKAFLSGSNI GVDKWTGAGD QKRQNVTFGL SQDFGDRITL DVTAVYNNYA ENFFRALTYP
QTTNLRNNYS HDFNTTLTGV AATDVNYYKF NRMNAKTFAT FAKLDYNFAE GQHLLFKPYY
WDNDTTRYNA AGSNVQIWRQ QNQNVGSVFE YAGQFPWGTD VVVGYWWQSM KPPPPPTDQR
RFTVDAAGGL DFSHWQSLAR IDNFSVNSPY FQVSQNFGST FVTSGLRYMV LGAPQMLYYN
TAGISDGTHG QALALNPAIY PDATLAARDY TAWLPNVAIR HDLNPALSVN FSYGRRFGRP
DWGPQASNYI SNRTVFTARG FSLQSLVDKV RPEISDQFDA SLRFSQYGLT VIPTLFYAKY
QNKQVKVIDP SIGPNIAYFQ GTGSSTGYGF ELEANYRFDE QFSVFGSTTL ASETFDSDTP
TLSGGAMLAT KGKQIPNTPQ VMIKGGVTYQ VDRLAIMPIV RYIGPRFGEA ANTQRVSGYT
VADLTMSYDL GSHFGVESLN ASFSIQNIFD RQYISQISPS DIDLSAGATY FLGAPRTVVG
SLSMKF