Gene Nwi_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1070 
Symbol 
ID3674267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1170260 
End bp1171240 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID637712620 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_317684 
Protein GI75675263 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.43387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.199498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTG TAACCGGAGG CGCCGGTTTT ATCGGATCGA ATCTCGTGGC CGCGTTGAAT 
GACGCCGGGC GAGGCGATGT GGTTGTGTGC GATGCGCTGG GGCATGACGG CAAGTGGCGC
AACCTCGCCA AACGGCAGCT TGCGGATGTC GTTCCGCCCG CGGAACTGAC GTGTTGGCTC
GATGGCCGCC GCCTCGACGC CGTCTTTCAT CTCGGCGCGA TCTCCGAGAC GACCGCGACC
GATGGCGATC TCGTCATCGA GACCAATTTC CGGCTGTCGC TGCGATTGCT CGACTGGTGC
GCCGGGACCG CGACGCCTTT CATCTATGCC TCGTCGGCAT CGACTTACGG CGACGGCGCG
CAGGGCTTTC GCGACGATCA ATCGTTGGCC GCGTTGCGCG CGCTACGGCC GATGAATCTC
TACGGCTGGA GCAAGCACCT GTTCGACATG GCCGTCGTGG GCCGCGCCGC CCAAGGCGGT
GCTTTGCCGC CGCAATGGGC CGGCCTGAAG TTCTTCAACG TGTTCGGACC GAATGAGTAT
CACAAAGGCT CCATGATGAG CGTGCTGACG CGTCGTTTCG ACGACGTCAA GGCGGGTCGT
CCTGTGCAGT TGTTCAAGTC GCATCGGGGG GGCATCGCCG ACGGCGATCA GCGCCGGGAC
TTCATCTACG TCGACGACGT CGTCCGCGTG ATGATGTGGC TGCTGGCCAC GCCTTCCGTG
AGCGGCCTTT TCAATGTAGG AACCGGCAAG GCCCGTAGTT TTCGCGACCT GATGACGGCG
GCCTATGCTT CGCTCGGCGC AAGGCCGAAC ATCGAATATA TCGATATGCC CGAACAGATT
CGCGGCGCTT ACCAGTACTT TACGCAGGCC GATGTCGCCC GCTTGCAAGG CGCGGGCTAT
AACGGCGGCT TCACGCCTCT GGAAGAAGCC GTGGATGCCT ATGTCAAAGG CTATCTCGAT
CGCGACGATC GCTTTCGCTG A
 
Protein sequence
MLLVTGGAGF IGSNLVAALN DAGRGDVVVC DALGHDGKWR NLAKRQLADV VPPAELTCWL 
DGRRLDAVFH LGAISETTAT DGDLVIETNF RLSLRLLDWC AGTATPFIYA SSASTYGDGA
QGFRDDQSLA ALRALRPMNL YGWSKHLFDM AVVGRAAQGG ALPPQWAGLK FFNVFGPNEY
HKGSMMSVLT RRFDDVKAGR PVQLFKSHRG GIADGDQRRD FIYVDDVVRV MMWLLATPSV
SGLFNVGTGK ARSFRDLMTA AYASLGARPN IEYIDMPEQI RGAYQYFTQA DVARLQGAGY
NGGFTPLEEA VDAYVKGYLD RDDRFR