Gene Nwi_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2079 
Symbol 
ID3675494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2278800 
End bp2280032 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content62% 
IMG OID637713645 
Producthypothetical protein 
Protein accessionYP_318689 
Protein GI75676268 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT CTTCCGCCCA AGTCGTTCTC GGCGAGAGCT TGACGCCCAT CGGTCAATTG 
CGTTTCACGC AGACTGGGCC GCGTCAGTTT TCAACCTTCT CGTACGATCC GGCGTGGATC
AAGGATCCCC GCGCCTTCAC CTTGCAGCCC GACATGCCGT TTGAGGGCGG ACCATTTCAT
GCGTCGGCGC AGCCAGGCAA TCCGCGTGAC GCGCTCGCCG GCGCCTTCTC CGATGCGGCA
CCCGACAGCT GGGGCCGCAG ACTTCTGGAA CGCGCTTACG GCAAGGGTCT ATCCGAGTTC
GAGTACCTGA CCCTGTCCGA CGACACCTGC CGGCAGGGTG CGCTACGCTT TCTTGATGAT
CAGGGGAAGG TCATCACCGG CAAATCTGTC GAGGCGGTGC CGCGCTTGCT CGACCTGCAA
GCGATCACCG CGATCGCCCG CGCCTACGAG CAGGGCAAGG ATATTTCGGC CGAAGACATG
CAAGCCCTCG CCGGCGCGGG CGGCTCGGGT GGAGGCCGGC CCAAAGCCAA TGTCCGTGAC
GAACATGTGC TCTGGTTGGC GAAATTCACC TCCATCCACG ATCAGCACCC GATCGAGCAG
ATCGAGGTCG CCACTCTCAC CCTCGCCAAG GCGTGCGGCA TTCGGACGCC CGAAGTCAGG
TTGGAGCTTG CCGATACGCC CTTCCCTGTC GCGCTCATCC AGCGCTTCGA CCGACGCGGC
AGCGCGCGCA TCCCCTATAT CTCGGCACGA ACGGCGCTTG GGAAGACCGG CACCGAACTC
GGCTCGTATA CGGAGATCGT CGACTTCATG CGGACAGCCG CGTCCGATCC GAAAGCGGAT
TTCCAGGAGC TCTATCGGCG TCTGATCTTC ACCATCCTCG TCTCGAATAA GGATGATCAC
CTGAAGAACC ATGGATTTCT CTATGTCGGT TCTGGCCGCT GGCGTCTTTC GCCGGTCTTC
GATGTGAACC CGGCTCCCGA TCGCAACCCG CATTTGGAGA CAGCGATCAT GGAAGGCGGC
AGCCACGATC GATCGATCAA GCTGGCTTTG GACGCTTGTG AATTCTTCGA GATTCCCGAG
GCTGACGCGC GGCGAACGAT CCGGACCGCC GCCCAACGGA TATCCGACGG GTGGCGGGAT
GCCTTCAGAC AAGTCGGCGT GACCGGAGCG CGCGTCCGTG ATTACGAGCC GGCCTTTATC
AACGAACAGA CGGACATCGG ACTCGCCCTT TAA
 
Protein sequence
MTESSAQVVL GESLTPIGQL RFTQTGPRQF STFSYDPAWI KDPRAFTLQP DMPFEGGPFH 
ASAQPGNPRD ALAGAFSDAA PDSWGRRLLE RAYGKGLSEF EYLTLSDDTC RQGALRFLDD
QGKVITGKSV EAVPRLLDLQ AITAIARAYE QGKDISAEDM QALAGAGGSG GGRPKANVRD
EHVLWLAKFT SIHDQHPIEQ IEVATLTLAK ACGIRTPEVR LELADTPFPV ALIQRFDRRG
SARIPYISAR TALGKTGTEL GSYTEIVDFM RTAASDPKAD FQELYRRLIF TILVSNKDDH
LKNHGFLYVG SGRWRLSPVF DVNPAPDRNP HLETAIMEGG SHDRSIKLAL DACEFFEIPE
ADARRTIRTA AQRISDGWRD AFRQVGVTGA RVRDYEPAFI NEQTDIGLAL