Gene Nmul_A2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2462 
Symbol 
ID3786419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2810976 
End bp2812091 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content59% 
IMG OID637812553 
Productrhomboid-like protein 
Protein accessionYP_413143 
Protein GI82703577 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCAC TCCATGATCT TCTTCACCGG CAGGTTCCTC GAGTTCCGGT AACGAAGCTG 
CTGGTGTCGA CCAACCTGCT GATCTTTGTA GCCATGCTTG CCAGCGGAGC AGGCTTGTGG
CACTCATCCA ATGGCGTACA ACTCGCCTGG GGCGCCAACT TCGGTCCCGC TACCCAGGAT
GGGGAGTGGT GGCGCCTCGG GACCGCCATG TTCCTTCATT TTGGCCTGGT CCATCTCACC
TTGAACCTCT GGGCGCTCTG GGATGCAGGC CAACTGGTTG AGCGCATGTA CGGGCACGCG
CGCTTTACCG CCCTCTACTT TGCCAGCGGT CTTGCCGGCA ATCTGCTCTC GCTGGTTGCC
CATAAAGGCT TGGCCATTTC CGGCGGCGCT TCGGGCGCCA TTTTCGGCCT ATATGGCGCC
CTGCTGGTAT TTCTCTGGCG CGAGCGCGGC AGGCTGCATC CCCACGAGTT CCGATGGTTT
TTCTGGGGCG CCACGGCTTT TGCAATTGTC AGCCTTGGGC TGGGCCTCGC AATTACCGGT
ATCGATAACG CTGCTCATAT CGGCGGTTTC GTGACCGGTC TGCTCGGCGG AATAGTATTT
GCAAACCCAA GGATGAACGA AAAGCCTTCT CATGTATTCA GCAGCCGCCT TTCCGCTATA
AGCATTCTTG CACTGGCCGT CTTCATGCTG ATCGTTCGGA TTCCCCCTCC CGCCTATAAG
TGGAGCGAGG AAGTATTGGC GCGCAAGGAA ATCGGCAATT TTCTGCGCGA TGACCGGGCG
ATCACCCAGG CTTGGCAGCA TATACTCGAT GAGGCCAGGC GAGGAGGAAT CTCCTTTGAC
GAACTGGCGG GGCAAATCGA TACTGCGGTG GGTAATCCCT ATGAAGAAAG CTTCGAGCAG
CTTTCGGAAC TTCCCCCTGA TCCCGCATTG CCTTCTGCCG CTACGGTAGA AATGCTGCGA
GACTACGCCG AACGCCGGAG GGATGCGTCC CGCGCCCTTG CGGAAGGTCT GCGCACTCAC
AATCCCGCGC AAATCCGCCA CGCGCTGGAA ATGGCGAGGG AGCCGCTCCA GCTGCCCAAG
CTCTCCCCGC CAACCCCGTC CGCCCTACCC CGCTGA
 
Protein sequence
MLSLHDLLHR QVPRVPVTKL LVSTNLLIFV AMLASGAGLW HSSNGVQLAW GANFGPATQD 
GEWWRLGTAM FLHFGLVHLT LNLWALWDAG QLVERMYGHA RFTALYFASG LAGNLLSLVA
HKGLAISGGA SGAIFGLYGA LLVFLWRERG RLHPHEFRWF FWGATAFAIV SLGLGLAITG
IDNAAHIGGF VTGLLGGIVF ANPRMNEKPS HVFSSRLSAI SILALAVFML IVRIPPPAYK
WSEEVLARKE IGNFLRDDRA ITQAWQHILD EARRGGISFD ELAGQIDTAV GNPYEESFEQ
LSELPPDPAL PSAATVEMLR DYAERRRDAS RALAEGLRTH NPAQIRHALE MAREPLQLPK
LSPPTPSALP R