Gene Nham_2682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2682 
Symbol 
ID4032380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2949049 
End bp2951013 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content64% 
IMG OID637971134 
Productsqualene cyclase 
Protein accessionYP_577921 
Protein GI92118192 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.145002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCCG TAAACGCCAC AGTCGCGCCG ATCGACGATG CCGCTCTCGG GGGCAGCATC 
GGCGCCGCGA CGCGCGGGCT TCTGGACCTC AAGCAGCCGG ACGGTCATTT CGTGTTCGAG
CTGGAGGCCG ACGCGACCAT CCCGTCCGAA TACGTCCTCT TGCGGCACTA TCTCGGCGAG
CCGGTCGATG CGGCGCTGGA AGCCAAGATC GCGGTCTACC TCCGTCGCAT CCAGGGCGCA
CATGGCGGCT GGCCGCTGGT GCATGACGGC CCGTTCGACA TGAGCGCCAG CGTGAAAGCC
TACTTCGCGC TGAAGATGAT CGGCGATTCC ATCGACGCGC CGCACATGGC GCGCGCGCGC
GAGGCAATCC TCTCCCGCGG CGGTGCGGCC AACGTCAACG TCTTCACGCG CTTCCTGCTC
TCGCTCTTCG AAGTGCTGAC ATGGCGCAGC GCCCCCGTGC TGCCGATCGA GATCATGCTG
CTGCCGATGT GGTCGCCGTT CCATATCAAC AAGATTTCCT ACTGGGCTCG CACCACCATG
GTGCCGCTGA TGGTGTTGGC GGCGCTGAAG CCGCGCGCGC GCAATCCGCG CGGAATCGGC
ATCCGCGAAT TGTTTCTTCA GGATCCGGCC ACGGTCGGCA CGCCGAAGAG GGCTCCGCAT
CAAAGCCCGG CCTGGTTCAC GCTGTTCAAC AGCCTCGACT GGATCTTGCG CAAGATCGAA
CCGCTGTTTC CCAAACGGCT GCGTGCGCGC GCGATAGAAA AGGCGATCGC GTTCGTCGAG
GAGCGCCTCA ACGGCGAGGA CGGTCTCGGC GCGATCTTTC CGCCGATGGT CAATACGGTG
ATGATGTATG ACGCGCTGGG CTTTCCGCCC GAGCACCCGC CGCGCGCAGT GGCACGGCGC
GGAATCGACA AGCTTCTGGT GATCGGCAAG GATGAGGCCT ATTGCCAGCC CTGCGTGTCG
CCGATCTGGG ATACCGCGCT GACCTGTCAT GCGCTGCTCG AAGCTGGCGG ACCCGAGGCG
CTGAGTGGCG CGGGGAAGAG CCTCGACTGG CTGCTCCCGA AGCAGGAGCT CGTTCTCAAG
GGCGACTGGG CCGTGAAACG TCCGGACGTG CGGCCCGGCG GCTGGGCGTT CCAGTACGCC
AACGCCCACT ATCCCGATCT CGATGACACC GCTGTCGTGG TCATGGCGAT GGACCGGGTA
CGCCGCAACG ATCGCAGCGA TAAATACAAC GAGGCGATCG CGCGCGGCCG CGAGTGGATC
GAGGGCATGC AGAGCCGGGA CGGCGGCTTT GCGGCGTTCG ACGCCGACAA TCTTGAATAC
TATCTCAACA ACATCCCGTT CTCGGACCAT GCGGCGTTGC TCGATCCGCC GACCGAGGAT
GTCACGGCGC GATGCGTCTC GATGCTGGCG CAACTCGGCG AGACCGTTCG CAGCAGCCCG
TCCATGGCGG CCGGTGTGGA CTATCTGCGC CGGACCCAGC TCAAGGAGGG GTCGTGGTAC
GGCCGCTGGG GTCTCAACTA CATCTACGGC ACCTGGTCGG TGGTCTGTGC GCTCAATGCC
GCCGGGGTCG ATCACCAGGA TCCGGCGATG CGCAAGGCGG TGGACTGGCT GGTGTCGATC
CAGAATGCCG ATGGCGGCTG GGGTGAGGAC GCTGTCAGCT ACCGGCTCGA CTATAAGGGG
TTCGAGGGGG CGCCGACCAC GGCCTCGCAA ACGGCCTGGG CTTTGCTTGC CTTGATGGCC
GCGGGCGAGG TCGAAAATCC GGCGGTGGCC CGGGGGATGA AGTACCTAAT AGACACACAG
ACCAAAAAAG GGCTGTGGGA CGAGCAACGC TTCACCGCCA CGGGGTTTCC ACGGGTGTTT
TACCTGCGGT ATCATGGCTA CTCCAGATTC TTCCCGCTCT GGGCGCTGGC GCGGTACCGG
AATTTGAGAA GCACCAACAG CAAGGTGGTA GGGGTCGGGA TGTGA
 
Protein sequence
MNSVNATVAP IDDAALGGSI GAATRGLLDL KQPDGHFVFE LEADATIPSE YVLLRHYLGE 
PVDAALEAKI AVYLRRIQGA HGGWPLVHDG PFDMSASVKA YFALKMIGDS IDAPHMARAR
EAILSRGGAA NVNVFTRFLL SLFEVLTWRS APVLPIEIML LPMWSPFHIN KISYWARTTM
VPLMVLAALK PRARNPRGIG IRELFLQDPA TVGTPKRAPH QSPAWFTLFN SLDWILRKIE
PLFPKRLRAR AIEKAIAFVE ERLNGEDGLG AIFPPMVNTV MMYDALGFPP EHPPRAVARR
GIDKLLVIGK DEAYCQPCVS PIWDTALTCH ALLEAGGPEA LSGAGKSLDW LLPKQELVLK
GDWAVKRPDV RPGGWAFQYA NAHYPDLDDT AVVVMAMDRV RRNDRSDKYN EAIARGREWI
EGMQSRDGGF AAFDADNLEY YLNNIPFSDH AALLDPPTED VTARCVSMLA QLGETVRSSP
SMAAGVDYLR RTQLKEGSWY GRWGLNYIYG TWSVVCALNA AGVDHQDPAM RKAVDWLVSI
QNADGGWGED AVSYRLDYKG FEGAPTTASQ TAWALLALMA AGEVENPAVA RGMKYLIDTQ
TKKGLWDEQR FTATGFPRVF YLRYHGYSRF FPLWALARYR NLRSTNSKVV GVGM