Gene Nwi_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2219 
Symbol 
ID3676417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2412121 
End bp2414199 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content62% 
IMG OID637713782 
ProductTonB-dependent receptor 
Protein accessionYP_318825 
Protein GI75676404 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CCTTCGTGTC CGGAGCTTAT CGCAGAAGCC TGCTTGCTTC CGCCCTCCTC 
GCCTCGCTGG GATCGATCGG CCTGAATACG CCCACAAGCG CGCAGCAATC CGCCTCTCCC
AACCTGCTGC CTCCGATCCG GATCGCGCCA CCCCCGGATC GAGCGATGGC GCCCGCACAC
CGCACGACTT CGCGATCACG CCGTGTCGTC CGCCCCGCTC CCCGGCAAAC TCCTGTCGCT
GAGACTCATG TTCCGGGCAC GATGGGTTCG ACCGTAGTCA GCCCCACGGG AGTGGTGACG
CCCGCGGGCC AGCTGGCCAG CTCCATCACC GTCGTCACCG AGCAGGATAT CCAGACCCAG
CAACACCGCA GCGTTCCGGA TATCCTCAGG ACGGCTCCCG GCCTGAATGT GGTGCAAGCC
GGCGGACCTG GCGCCCAGAC CTCGATCTTC ATGCGAGGCA CGAACGCCAA CCACACCAAG
GTGATGCTCG ACGGAATCGA CATCGGCGAT CCCGCCAACT CGAATGGCGC GTTCGACTAT
GCGCACCTTC TCACGGCGGA CATCCAGCAG ATGGAAATCC TCCGGGGCCC GCAGAGCGGA
CTCTACGGCT CCGACGCGAT CGGCGGCGTC ATCTCCATCA TCACGAAAAA GGGCGAAGGT
CCGGCGCGGG TCACCGGCTC ACTCGAGTCC GGCTCGTTCA AAACCTTCAA CCAGACGCTC
GGACTGAGCG GCGCCGAACG GAACTTCAAC TACGCGGTCA ACGTCGCCCA TCTCCATGCC
GGAGACGTTC CGGTCACGCC GCCCGAGCTT CTGCCGCCGG GGCGGCAGGC GATCGGCAAC
AACTATGACA ACATGACTTA CTCGACCAAG CTCGGAGTCG ATCTCAATGA GTATCTGACC
GTCAATTCGG TGGTCCGCTA TACCGACTCG ACGCTGCTGT TCACGGGCGA CGGCGGATTT
CCAAGCACTC CCAACGCCAG CCAGAGCACG CACGCCGTCC AGCAACTCTT CAACCGTCAG
GAGGCCGTGT GGTCGCTGTT CGACGGACGC GTCCAGAGCT TCTTCGGTCT CAATTTCACG
AACAGCAGAG CCTATGACGT GGGTCCGGGC GACCCGGCCG CGACGATCAC CACGGGAGAA
CGCCTCAAAT TCGACTGGCG CACCGTGACC GAGATTACGC GAGACAACCA TCTGATCGTC
GGCGCCGAGC ATCAGACCGA TCGCATGGAC ACCGCCGATT TTGCCGCCAG GAACGGCAAC
AAGGCAGGTT TCGTGCAGTT GCAATCGGCT TTCGCGGACC GCTTTTTCCT GGTGGCCAAT
GTCCGCCAGG ATTCCAACGA CCTGTTCGGC GGCCGCATGA CTTACCGGAT TGCGCCGGCG
GTCATTGTTC CCGTTACCGA AACCAAGCTG AAAGCAAGCT ACGGCACGGG ATTCAAGGCG
CCCTCGCTGA GCCAGTTGTT CCGGGATTAT CCGACCTTCA ATTTCTTCGC CAATCCCAAC
CTGCAACCGG AAGACAGCCG GGGCTTCGAC GCCGGTTTCG AGCAACCTTT GTTCAACGAC
CGCGTGCGTT TCGGCTCGAC CTACTTCCGA AACGACATCA CCAACCTGAT CGACTACAAT
TCCACGTTCA CCTCACTGGT GAACGTGAAC AGCGCGACGA CCGAGGGCAC CGAGACTTTC
GTCGCGGCGC AGATCACCGA ACGTTTCGGA ATCCGCGCGG ATTACACCTT CATCCGCGCC
GTCAATGCCG CCACCGGCAT GCAACTGCTG CGCCGGCCCA AGGAGAAATG GAGCGCAACC
GCGACCTGGC TTCCGCTTGA TGCGTTGACG TTGTCGGCAA CTTTGGTCCG GGTCAACGAC
TGGCTCGATG TCACCCGCGA CGGGATGGCA TCCGGAATCA CGGTGCCCGG CTATACGCTG
GTGAACTTGC GCGGCGACTA CGCACTCAGC GACCAGGTCA AGGTATTCGG ACGGATCGAT
AATCTCCTGG ACTTCCGCTA CCAGAATCCG ACCGGATTCC TCGCGCCGGG CCTCGCGGTT
TTCGGCGGCA TCCGGGTGGC CAGTTACGGA GTGCGATAG
 
Protein sequence
MSAAFVSGAY RRSLLASALL ASLGSIGLNT PTSAQQSASP NLLPPIRIAP PPDRAMAPAH 
RTTSRSRRVV RPAPRQTPVA ETHVPGTMGS TVVSPTGVVT PAGQLASSIT VVTEQDIQTQ
QHRSVPDILR TAPGLNVVQA GGPGAQTSIF MRGTNANHTK VMLDGIDIGD PANSNGAFDY
AHLLTADIQQ MEILRGPQSG LYGSDAIGGV ISIITKKGEG PARVTGSLES GSFKTFNQTL
GLSGAERNFN YAVNVAHLHA GDVPVTPPEL LPPGRQAIGN NYDNMTYSTK LGVDLNEYLT
VNSVVRYTDS TLLFTGDGGF PSTPNASQST HAVQQLFNRQ EAVWSLFDGR VQSFFGLNFT
NSRAYDVGPG DPAATITTGE RLKFDWRTVT EITRDNHLIV GAEHQTDRMD TADFAARNGN
KAGFVQLQSA FADRFFLVAN VRQDSNDLFG GRMTYRIAPA VIVPVTETKL KASYGTGFKA
PSLSQLFRDY PTFNFFANPN LQPEDSRGFD AGFEQPLFND RVRFGSTYFR NDITNLIDYN
STFTSLVNVN SATTEGTETF VAAQITERFG IRADYTFIRA VNAATGMQLL RRPKEKWSAT
ATWLPLDALT LSATLVRVND WLDVTRDGMA SGITVPGYTL VNLRGDYALS DQVKVFGRID
NLLDFRYQNP TGFLAPGLAV FGGIRVASYG VR