Gene Nmul_A0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0030 
Symbol 
ID3784019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp30847 
End bp32124 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content63% 
IMG OID637810099 
Producthypothetical protein 
Protein accessionYP_410731 
Protein GI82701165 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.29018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCTA TCGAATCGTC AGCCTCGAGT CGCCACACGA GCTCTCTGAA GCGATCACTT 
GCCGATAACC GGATGGGGCG TCTGGCGACG GCGCTGCTGA TGGTGATGCT TGCGGTTCTC
GTTCTGACCA ACATATTTCT TCCGGTTCAT CCCGCGATGG GCTACGTTCG CGCTTTTGCC
GAAGCTGCGG TAGTGGGCGC GCTGGCGGAC TGGTTTGCCA TAACGGCGTT ATTCCGCCAG
CCCCTCGGCC TGCCCATTCC CCACACTGCA ATCATTCCGC GTAACAAGGA TCGCATCGGG
GAATCGCTGG GACGCTTCGT GGAGAGCAAC TTTGCTTCTC CCGAAGTGGT TGCCGCCAAG
CTTGCGCCTG TGGATTTGTC CGGGAAACTG GCAACGTGGC TGTGCGGGGA GGCGCGTACC
GACCTGCTGG CGGATTATGT GACGCACCTG ATTCCGGAAT TGCTGGATTC GGTGGACGAG
CGCCATGTGC AGCATTTCGT TTCGGCCGGG GTGCTGGAAA AAGCGGGGCG CATCGATCTT
GGCCCTTTGC TCGGGGAGGC GGTGAGGATG CTCACTGCGG AAAAGCGGCA CCAGCGGCTG
CTGGACAAGC TGTTGCGCGA GGCTGATGAA TATGTGACCG CGAACGAATC CCGTATCCGT
CAGCGGGTGC GCGAAAACAC AGCCTGGTTC TGGCAGCGGC TTTCGATGGA TGAGAAGGTG
GGGGAAAGCG TGGTGGCAGC CCTGCGCGAG GTGGTGGCGG AGATCGCGCG CGACCCTGCC
CACCCCCTGC GCTTGCGACT GGATGCTGCC ATCGGCAAGC TTGCCTCCGA CCTGGCTACT
TCACCCGAGT ATCGCGAGCA GGTTGCCGCC CACACCCGCA AGCTGCTGGA GCATCCGGCC
TTGCGGGACT ACGCGGACGG AGTCTGGCGC GACCTCCGCA ACGGGATGCG CGAGGACATC
GACAGCGAGG ACTCGGCAAT CAGGGGGTGG ATGCGGGGCC TCATACAGTC GGGCACCGAT
ACTGTACTTG AGGACCGTGG TTTGCGGGAG CGGCTCAATA ACTGGATGCG GGAGGTGCTG
GTGGAAGCGG TGCAGTCTCA CCAGCGCGAT GTGGGCAGGC TGATTGCCGA CACCGTGCGG
GAGTGGGACA CGCAGACAGT GACGCACCGC ATCGAGCGGC AGGTGGGCGA GGACCTGCAG
TACATCCGCA TCAATGGCAC GCTGATAGGC GGACTGGCAG GCCTGGCGAT CTACACCATC
GCCCACCTGT TCGCTTGA
 
Protein sequence
MQPIESSASS RHTSSLKRSL ADNRMGRLAT ALLMVMLAVL VLTNIFLPVH PAMGYVRAFA 
EAAVVGALAD WFAITALFRQ PLGLPIPHTA IIPRNKDRIG ESLGRFVESN FASPEVVAAK
LAPVDLSGKL ATWLCGEART DLLADYVTHL IPELLDSVDE RHVQHFVSAG VLEKAGRIDL
GPLLGEAVRM LTAEKRHQRL LDKLLREADE YVTANESRIR QRVRENTAWF WQRLSMDEKV
GESVVAALRE VVAEIARDPA HPLRLRLDAA IGKLASDLAT SPEYREQVAA HTRKLLEHPA
LRDYADGVWR DLRNGMREDI DSEDSAIRGW MRGLIQSGTD TVLEDRGLRE RLNNWMREVL
VEAVQSHQRD VGRLIADTVR EWDTQTVTHR IERQVGEDLQ YIRINGTLIG GLAGLAIYTI
AHLFA