Gene Nwi_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0073 
Symbol 
ID3675632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp87836 
End bp88846 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content65% 
IMG OID637711609 
Productthioredoxin-related 
Protein accessionYP_316693 
Protein GI75674272 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0246134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGCCA AGCATCGGCG AACGTCGGAT AACGGCACCA GACCGCTCAT TCCGGGGACA 
TCAGGCGAGG ATCACGTGAC CATAGTCGAG CACGACGGCG GGCCGCCACC ACAGGCGCCG
GATCTGATCA ACGAGACGAC GACGCAGACC TTCATGAAGG ATGTCGTCGA GGAATCGATG
CACCAGCCGG TGCTGGTCGA TTTCTGGGCG CCGCGGAGCG GACCGAGCCG CCAACTGAGC
CCGCTGCTGG AAAAGGCGGT GCGCGCCGCC GCCGGCAAGG TCAAGCTGGC GAAGATGAAT
ATCGATCAGC ATCCCGCCAT CTTTCAGCAG CTCGCGGCCC AGATCGGCAG CCATTCGATC
CCGGCGGTGT TCGCCTTCGT CGGCGGGCGG CCGGTCGATT ATTTCACAGG CGCGGTCCCC
GAAAGCCAGG TCAAGGACTT CATCGACAAG CTGACGCAAG GCGCGGGGGC GGCGCCGGGC
GCCCCTAACA TCGAAGAGAT CCTGCAAGAG GCTGACGCCG CGCTCGCTGC AGGCGATCCG
GCCACCGCGG CCGCGGTTTA TGCCGAGGCT CTCGGGATCG ACGCCGCCAA TCTTCGGGCG
ATCGCCGGGC TGGCGCGCTG CTATGCCAGC ACCGGCGCGA TCGACAAGGC CAAGCAAACG
CTCGCGCTGG TTCCGGAATC GAAGCGTGGC GACGCCGCCG TGACAACCGT TCAGGCCATG
ATCGACCTTG CCGAACAGGC GAGCTCGCTT GGACCGATCG CCGAGCTTGA GCAGAAGGTC
GCGGCCGACC CGCTCGATCA TCAGGCACGC TTCGACCTGG CTACGGCATT GAACGCCGGC
GGCAAACGCA GCGAGGCCAC CGATCACCTG CTTGAAATCG TGAAGCGCGA TCGCAAATGG
AACGATGATG CCGCCCGCAA GCAGCTTGTG CAGTTTTTCG AGGCATGGGG CGCCACAGAC
GAGGCCACCG TGGAGGGGCG CAAACGACTG TCGACGATTC TGTTTTCCTA A
 
Protein sequence
MLAKHRRTSD NGTRPLIPGT SGEDHVTIVE HDGGPPPQAP DLINETTTQT FMKDVVEESM 
HQPVLVDFWA PRSGPSRQLS PLLEKAVRAA AGKVKLAKMN IDQHPAIFQQ LAAQIGSHSI
PAVFAFVGGR PVDYFTGAVP ESQVKDFIDK LTQGAGAAPG APNIEEILQE ADAALAAGDP
ATAAAVYAEA LGIDAANLRA IAGLARCYAS TGAIDKAKQT LALVPESKRG DAAVTTVQAM
IDLAEQASSL GPIAELEQKV AADPLDHQAR FDLATALNAG GKRSEATDHL LEIVKRDRKW
NDDAARKQLV QFFEAWGATD EATVEGRKRL STILFS