Gene Nwi_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2269 
Symbol 
ID3675174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2472851 
End bp2474815 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content63% 
IMG OID637713832 
Productterpene synthase/squalene cyclase 
Protein accessionYP_318875 
Protein GI75676454 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.178212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCA TCAACGCCAC AGCCGCGCCG ATCGACGATA ATGTTCTCGG GGACCGTATC 
GGCGCCGCGA CGCGCGGGCT TCTAAGCCTC AAGCAGTCGG ACGGTCATTT TGTGTTCGAA
CTCGAGGCCG ACGCAACCAT CCCGTCCGAA TACATCCTGA TGCGGCATTA CCTCGGCGAA
CCCGTCGATA CGGTGCTCGA AGCCAAGATC GCGGCTTACC TCCGCCGCAT CCAGGGCGCG
CATGGCGGCT GGCCGCTGGT GCATGACGGC CCGTTCGACA TGAGCGCCAG CGTGAAGGCC
TATTTCGCGC TGAAAATGGC CGGCGATTCC ATCGACGCGC CACACATGGC GCGGGCGCGC
GAGGCCATCC TGTCCCGAGG CGGCGCGGCG AATGTGAACG TCTTCACGCG CTTTCTGCTC
TCGTTTTTCG GCGAACTGAC GTGGCGCAGC GTTCCGGTGC TGCCGGTCGA GATCATGCTG
CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGGTCTCCT ACTGGGCGCG CACCACCATG
GTGCCGCTGA TGGTGCTGGC CGCGCTGAAG CCGCGCGCGC GCAATCCGCG CGGCATCGGC
ATCCGCGAAC TTTTTCTTGA GGATCCGGCG ACGGTAGGCA CGCCGAAGAG GGCTCCGCAC
CAAAGTCCGG GCTGGTTCGC GCTGTTTACC GGCTTTGACC GGGTCTTGCG GCTGATAGAA
CCGCTGTCTC CCAAGTGGCT GCGGGCGCGC GCCATGAAAA AGGCGATCGC GTTTGTCGAG
GAGCGCCTCA ACGGAGAGGA CGGCCTCGGC GCGATCTTTC CGCCGATGGT CAATACGGTG
ATGATGTATG ACGCGTTGGG ATTCCCACCG GAGCATCCGC CGCGCGCGGT GACGCGACGC
GGCATCGACA AGCTTCTGGT CGTCGGCGAA AATGAGGCCT ACTGCCAGCC ATGCGTGTCG
CCGATCTGGG ATACCGCGCT GAGCTGTCAC GCGCTGCTCG AAGCGGGCGG ACCTGAGGCC
GTAAACAGCG CCGGCAAATG CCTCGATTGG CTACTTCTGA AACAGGAACT GGTTCTCAAG
GGCGACTGGG CGGTGAAACG TCCGGACGTG CGGCCGGGCG GCTGGGCGTT TCAATACGCC
AACGGCCACT ATCCCGATCT CGACGATACC GCTGTCGTGG TCATGGCGAT GGATCGGGTG
CGCCGGAACG GCCCGAATGG TCGATACGAC GAAGCGATCG CGCGTGGTCG TGAGTGGATC
GAGGGGATGC AGAGCCGGGA CGGCGGCTTT GCCGCGTTCG ATGCCGACAA TCTTGAATAC
TACCTTAACA ACATCCCGTT CTCCGACCAT GCCGCTCTGC TCGATCCGCC GACCGAGGAT
GTCACCGCGC GGTGCGTCTC GATGCTGGCG CAACTCGGCG AGACCGTGGA CAGCAGCTCG
TCCATGGCGG CGGGAGTCGA GTATCTGCGC CGGACCCAGC TCGCGGAGGG TTCGTGGTAC
GGCCGCTGGG GCCTGAACTA CATCTACGGC ACCTGGTCAG TGCTCTGCGC GCTCAACGTC
GCCGGGGTCG ATCACCAGGA TCCCGTGATC CGCCGGGCGG TGAACTGGCT GGTGTCGATC
CAGAATGCCG ATGGCGGCTG GGGCGAGGAT GCGGTCAGCT ACCGACTCGA CTATAAGGGA
TTCGAGGGAG CGCCGACCAC GGCTTCGCAG ACGGCCTGGG CGTTGCTGGC CTTGATGGCG
GCGGGCGAGG TCGAAAATCC TGCGGTGGCG AGGGGAATCA AGTACCTGAT AGACACACAA
ACAAAAAAAG GTCTGTGGGA CGAGCAGCGC TATACGGCCA CGGGCTTTCC ACGCGTATTT
TATCTGAGGT ACCATGGCTA CTCCAAGTTC TTCCCGCTCT GGGCGCTGGC GCGGTATCGG
AATTTGAGAA GCACCAATAG CAAGGCGGTA GGGGTCGGGA TGTGA
 
Protein sequence
MNSINATAAP IDDNVLGDRI GAATRGLLSL KQSDGHFVFE LEADATIPSE YILMRHYLGE 
PVDTVLEAKI AAYLRRIQGA HGGWPLVHDG PFDMSASVKA YFALKMAGDS IDAPHMARAR
EAILSRGGAA NVNVFTRFLL SFFGELTWRS VPVLPVEIML LPMWSPFHLN KVSYWARTTM
VPLMVLAALK PRARNPRGIG IRELFLEDPA TVGTPKRAPH QSPGWFALFT GFDRVLRLIE
PLSPKWLRAR AMKKAIAFVE ERLNGEDGLG AIFPPMVNTV MMYDALGFPP EHPPRAVTRR
GIDKLLVVGE NEAYCQPCVS PIWDTALSCH ALLEAGGPEA VNSAGKCLDW LLLKQELVLK
GDWAVKRPDV RPGGWAFQYA NGHYPDLDDT AVVVMAMDRV RRNGPNGRYD EAIARGREWI
EGMQSRDGGF AAFDADNLEY YLNNIPFSDH AALLDPPTED VTARCVSMLA QLGETVDSSS
SMAAGVEYLR RTQLAEGSWY GRWGLNYIYG TWSVLCALNV AGVDHQDPVI RRAVNWLVSI
QNADGGWGED AVSYRLDYKG FEGAPTTASQ TAWALLALMA AGEVENPAVA RGIKYLIDTQ
TKKGLWDEQR YTATGFPRVF YLRYHGYSKF FPLWALARYR NLRSTNSKAV GVGM