Gene Nwi_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1502 
Symbol 
ID3675905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1630717 
End bp1631736 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content64% 
IMG OID637713055 
Productallophanate hydrolase subunit 2 
Protein accessionYP_318115 
Protein GI75675694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0900497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC TTGTCGTCAC CGCTGTCGGG CCCGTCACCT CGGTTCAGGA CGGCGGCCGC 
TCCGGCGCGC AGCGTTACGG GTTGCCGCCA GGCGGCGCGG CGGATCGACT CGCGCTGGCT
GCCGGCAATT GTCTGGTCGG GAATCCCCCC TTTGCGGCTG CGATTGAAAT CGGCCCCTTC
AGTACGAGCT TCACCGTTCG TGAAGGCAGG GTACGCGTTG CGCTCACGGG TGCCGCTCGG
AGTGCGGATG TTTCGGGCCG CCCGGTTTCG TTCAACGAAT CCTGCACGTT CGGTGATGGC
GACAGCCTGA CACTCGGCGT GGCGCGCGAC GGCACCTTCA GCTACCTTGC GATCGAAGGC
GGCGTGAAGG GCGAACCGAT GTTCGGCAGC CTGGCCGTTC TTGCACGCGC TGGTCTCGGC
AGTCCGTTTC CCCGACCGTT ACAGGCTGGC GACGTGCTCG ATGTGGAGGC GGCGACCACC
GGGGCCGAAC ATGGGATCGA ACTTCCCGGC CAGGACGACG GCCCGATCCG CATCGTGATG
GGTCCGCAAG ACGATGAGTT CGGAGAGGCT ACAGGACTGT TTCTCGGCAG CGAGTGGAAA
ATCTCGGCGA CGAGCGATCG CATGGGCTAT CGCCTCGAAG GCCCGATCAT CGAACACCTT
CACGGCCATA ACATCGTCTC CGATGGCACC GTGAACGGCA GCATTCAGAT TCCCGGCAAC
GGACAACCGA TCGTCCTGAT GCCGGATCGC GGCACCAGCG GCGGCTATCC GAAGATCGCA
ACCGTCATGA CGGCGGATCT CGGTCGTTTC GCACAAATTC CCGTGGGCCG CGGTTTCCGC
TTCAAGTCGG TTACCGTCGC CGAGGCGCAG GCTGAAGCGC GCGCCATGGC CGACCTGTTG
CGAACGCTGC CCGGCCGGGT GCGCGCGGTG CGCAATACCG ACATTGACGA TGCCTTGCGC
AACGCCAATA TCGCCGGAAC CGCGGTGAAT GCATTCGATA GCGGGACGTG GCAGACATAG
 
Protein sequence
MSKLVVTAVG PVTSVQDGGR SGAQRYGLPP GGAADRLALA AGNCLVGNPP FAAAIEIGPF 
STSFTVREGR VRVALTGAAR SADVSGRPVS FNESCTFGDG DSLTLGVARD GTFSYLAIEG
GVKGEPMFGS LAVLARAGLG SPFPRPLQAG DVLDVEAATT GAEHGIELPG QDDGPIRIVM
GPQDDEFGEA TGLFLGSEWK ISATSDRMGY RLEGPIIEHL HGHNIVSDGT VNGSIQIPGN
GQPIVLMPDR GTSGGYPKIA TVMTADLGRF AQIPVGRGFR FKSVTVAEAQ AEARAMADLL
RTLPGRVRAV RNTDIDDALR NANIAGTAVN AFDSGTWQT