Gene Nwi_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2202 
Symbol 
ID3676870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2395555 
End bp2396781 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID637713765 
ProductVWA containing CoxE-like 
Protein accessionYP_318808 
Protein GI75676387 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.731855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCA ATCACTTTGA TTCGCCGACC GGCCACATCG CGGACAATGT CGCCGGCTTC 
GCCCGCACGC TGCGCGCTTG TGGTCTTCCG GTGGGTCCCG GCGCGGTCAT CGACGCGCTG
AAGGCTTTGC GTCTGATCGA CATCGGCAAT CGCGCGGACG TCTTCAGCAC GCTTCAGGCG
ATTTTTATGA CGCGCCACGA CCATGCGCCG ATATTCGCCC AGGCCTTTGA TCTTTTCTTC
TGCATCGCGG AGGAAGGGAA AAACATGCTG GATCCGGTCC AGCCGCTGGA CCAGGCCAGG
AAGCCGCCGC CGTCGGCGTC CCGGCGTGTT CTTGAGGCAT TGTCGCGACC TGCGATCACG
AGCGAGCGCG AGGCGCCGCA GGGACAGGAG ATGCGGCCTT CAGTCTCCGA TCTGGAGGTG
CTGCAAAAGA AGGATTTCGC GCAGATGAGC GCAGCCGAAC TGGCCCAAGT CACGCAGATC
ATCGCGAACA TGAAGCTGCC GCAAGCCAGA CTGCACACAC GACGTATCCG GCCCGATCCA
CGAGGGTCGC GGCTGGATTT GCGCCGCACG CTGCGTGGCG GGCTGCACAC CGGGGGCGAG
ATTGTCGATA TTCATCGCCT TGGGAGGATC GACAAGCCGG CTCCGATCGT AGCCCTGCTC
GATATCTCCG GCTCGATGAG CGAGTATACC CGCCTGTTCC TGCATTTCCT CCATGGCATC
ACAAACCGGC GCGGGCGCGT TTCGGTCTTT CTGTTCGGAA CGCGGTTGAC GAATGTCACG
CGCGCGTTGC GGGCGCGCGA TCCCGACGAG GCGCTGGCCG CCTGTTCCTC GGTGGTGGAG
GATTGGGCGG GGGGAACGCG GATCGCGACC TCGCTCCACG ATTTCAACAA GCTGTGGAGC
AGGCGCGTGC TCGGGCAAGG CGCCGTCGTC CTGATGATCA CTGACGGGCT GGAGCGGGAG
GCCGGCTCCG AACTGGCGTT CGAGATGGAT CGGCTGCACC GATCCTGCCG CCGATTGATC
TGGCTGAACC CGCTATTGCG CTATGATGGC TTTGAACCCA GGGCGCGAGG CATTAAGATG
ATGCTGCCGC ACGTTGACGA ATTTCGCCCG GTGCATAATC TGTCATCCAT CGAGGGACTC
ATTGCCGCGC TTTCCCAAGC ACCGGGACGA CATGGTCGCG GCGCCGCCCG CGGCTTGAAC
GAGGGGCATT ATGCTCGATC GCGATGA
 
Protein sequence
MAINHFDSPT GHIADNVAGF ARTLRACGLP VGPGAVIDAL KALRLIDIGN RADVFSTLQA 
IFMTRHDHAP IFAQAFDLFF CIAEEGKNML DPVQPLDQAR KPPPSASRRV LEALSRPAIT
SEREAPQGQE MRPSVSDLEV LQKKDFAQMS AAELAQVTQI IANMKLPQAR LHTRRIRPDP
RGSRLDLRRT LRGGLHTGGE IVDIHRLGRI DKPAPIVALL DISGSMSEYT RLFLHFLHGI
TNRRGRVSVF LFGTRLTNVT RALRARDPDE ALAACSSVVE DWAGGTRIAT SLHDFNKLWS
RRVLGQGAVV LMITDGLERE AGSELAFEMD RLHRSCRRLI WLNPLLRYDG FEPRARGIKM
MLPHVDEFRP VHNLSSIEGL IAALSQAPGR HGRGAARGLN EGHYARSR