Gene Nmul_A2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2556 
Symbol 
ID3786282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2924736 
End bp2926697 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content58% 
IMG OID637812647 
Productsqualene cyclase 
Protein accessionYP_413237 
Protein GI82703671 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.511022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TCGGAGGTAT GGCGAGAACT TCATTGCAAG CTCAATCCCC GGGCTCAAAC 
AATACTCCCT CAATGGACGA AAAAATGCTG AAAGCGGGAC TGGAAGCGGC ACGCGGTGCG
CTGCTGGCTC AGCAGAGGGA GGATGGCCAC TGGTGCTTTC CATTGGAGGC CGATTGCACG
ATTCCCGCCG AGTATATACT GATGATGCAC TTCATGGATG AAGTGGACCT GGATCTGGAG
GTCAGGATTG CCCGTTTTAT TCGTGAAAAA CAGGATGTAG CGCACGGAGG CTGGCCGCTC
TACTACGGCG GTGAATTCGA TCTGAGTTGT TCGGTCAAGG CCTATTATGC TCTGAAAATT
GTGGGGGACT CCCCGGACGC GCCCCACATG GTGCGTGCGC GTGCAGCAAT TCTGAAACAT
GGCGGCGCAG CGAGAGCAAA TGTATTTACA CGCCTGCTGC TTGCCATGTA TGACCAGCTT
CCCTGGCGAG GCGTGCCATT TGTGCCGGTG GAAATCATAC TGTTCCCCAA GTGGTTTCCA
TTCCATACCA GCAAGGTCGC ATACTGGTCG CGGACCGTAA TGGTTCCGCT ATCTATCCTG
TGCAGCCTGA AGGCGCGCGC GGCGAATCCG CGCAAAGTCG CCATTCGTGA ATTGTTTACG
GTACCTCCGG GGGAGGAGCG GAATTATTTT CCGGTACGTA CTGCGCTCAA TCGCGTGTTT
CTGCTGATCG AGCGCACTCT CTCCCTGCTC GAACCCTTCA TACCCCAAGG GGTGCGCAGA
TTGGCGCTGC GGCGGGCGGA AAGCTGGATC GTGGAAAGGC TGAACGGCGA CTCAGGCCTG
GGGGCGATCT TTCCTGCCAT GGTGAATGCC GGCGAAGCTC TGGCATTGCT TGGATACCCA
TATGATCATC CCGCGCGGGA GCAATGCCGC AAGGCGCTGC GCCTGCTGCT GGTGGAGGAA
GGCGAGCGTA CCTGGTGCCA GCCCTGTGTT TCCCCAGTAT GGGATACCGT CCTGACCTGT
CTCGCCTTTC AGGAGGATAC GGAGGTCGAT CAGAAGCCCA TCCGGAAAGC GCTCGACTGG
CTGGTTCCCT GTCAGGTGCT CGATGCGCCG GCGGACTGGC AAGAAGATCA TCCCGGATTA
CCGGGCGGGG GCTGGGCTTT TCAGTATGCA AATCCCCACT ATCCCGATCT CGACGACACG
GCTGCGGTGG CGTGGGCACT GTACCAGGCC GACCCTAAGG CTTATCAGGA AAGCATCAGC
CGGGCCGCCG ACTGGCTGGC GGGGATGCAG TCCAGCAATG GGGGGTTTGC GGCTTTCGAC
AGCGACAATA CGTATTACTA TCTCAACGAG ATCCCGTTTG CCGATCATGG GGCGCTCCTT
GATCCTCCCA CCAGTGATGT TTCCGCCCGC TGTGCAGGTT TTCTCGCCTT GTACGGCCAG
TCCAGGCATA AACAGGCGCT GGAACGGAGT CTGGCATACC TCTTCAATGA ACAGGAGGCC
AGTGGCGCCT GGTTCGGCCG CTGGGGCAGC AATTACATCT ATGGAACGTG GTCTGTGCTG
GAGGCCTTCC GTCTGGCCGG GATAGATGCC GGTCATCCTG CCATCCGACG CGCGGTGCAC
TGGCTCAAAT CCGTGCAGCG GGAGGATGGC GGATGGGGAG AAAGTAACGA CAGCTATCTT
TCCCCCCAGC AGGCAGGCCA GTTTCATACC AGTACTTCTT TTCATACCGC ATGGGCACTG
CTTGCACTGA TGGGAGCCGG CGAATGGCGA AGTCATGAGG TTCACCGGGG AATTGCCTAC
CTCTTGCGGG AACAGGACAG CGACGGGCTC TGGCATGAGC CCTGGTTTAC CGCTCCCGGC
TTCCCGCGGG TTTTCTACCT CAAGTACTAC GGGTATACGA AATATTTCCC GGTATGGGCA
TTGACCCGAT TTCATGCATT GAACCGGAAG TTCCCCGGGT GA
 
Protein sequence
MKKFGGMART SLQAQSPGSN NTPSMDEKML KAGLEAARGA LLAQQREDGH WCFPLEADCT 
IPAEYILMMH FMDEVDLDLE VRIARFIREK QDVAHGGWPL YYGGEFDLSC SVKAYYALKI
VGDSPDAPHM VRARAAILKH GGAARANVFT RLLLAMYDQL PWRGVPFVPV EIILFPKWFP
FHTSKVAYWS RTVMVPLSIL CSLKARAANP RKVAIRELFT VPPGEERNYF PVRTALNRVF
LLIERTLSLL EPFIPQGVRR LALRRAESWI VERLNGDSGL GAIFPAMVNA GEALALLGYP
YDHPAREQCR KALRLLLVEE GERTWCQPCV SPVWDTVLTC LAFQEDTEVD QKPIRKALDW
LVPCQVLDAP ADWQEDHPGL PGGGWAFQYA NPHYPDLDDT AAVAWALYQA DPKAYQESIS
RAADWLAGMQ SSNGGFAAFD SDNTYYYLNE IPFADHGALL DPPTSDVSAR CAGFLALYGQ
SRHKQALERS LAYLFNEQEA SGAWFGRWGS NYIYGTWSVL EAFRLAGIDA GHPAIRRAVH
WLKSVQREDG GWGESNDSYL SPQQAGQFHT STSFHTAWAL LALMGAGEWR SHEVHRGIAY
LLREQDSDGL WHEPWFTAPG FPRVFYLKYY GYTKYFPVWA LTRFHALNRK FPG