Gene Nmul_A1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1914 
Symbol 
ID3784152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2204065 
End bp2205120 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID637812000 
ProducttRNA pseudouridine synthase A 
Protein accessionYP_412601 
Protein GI82703035 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0101] Pseudouridylate synthase 
TIGRFAM ID[TIGR00071] pseudouridylate synthase I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTGTG CGAAAGCGGT GGGCAATGAC GGACTACCCA ATCAGCGCCC AATGTCACGC 
TGGAGAGCGT CTGCTTATAC GTCATTACTT CCGTTTCCCG AAAAGTGTTT AAAAATTCCC
TGGACGTGCC GGTACAAGTA TCACCAGAAT CACGATGGTA GCTTTACCGT GAGAATCGCA
ATGGTGCTGG AGTATGACGG CAGCAATTTC TGCGGCTGGC AAAGCCAGCC AGGGGGAAAT
ACCGTGCAGG ATGCCGTGGA AGCGGCCTTG TCTGAAATTG CAGGCGAGGC TATCCGAGTA
GTGACGGCAG GGAGAACCGA CGCAGGGGTT CATGCGATCT ACCAGGTGCT GCATTTCGAT
ACTCGGGCGG AGCGACCTAT GAATGCATGG GTGCGGGGTG CAAACGCCCT GCTGCCCAGC
GGCATTGCCC TGCTATGGGC ATCCCCTACT GCAGACGATT TTCATGCCCG CTACTGTGCG
CTTGAGCGTT GTTACCTCTA CCTGTTACTG AACCACCCAG TGCGGCCGGG CCTTCATCAG
CACCGAGTCG GCTGGTATCA CCATCCGCTC CGTCTCGAAT CCATGCAGAT GGGGGCACAA
ATGCTGGTGG GCGAACACGA TTTCAGCGCC TTTCGGGCTG CTGCATGCCA GGCCAAATCC
CCTGTACGCA CCCTGACAAA ACTGGAAGTT ACGCGAGTGG GAAACATGGT TGCGTTTGAG
CTGCGCGCTA ATGCATTTTT GCACCACATG GTACGGAATA TCGTCGGTTG TCTGGTCTAT
GTAGGTAAAG GTAAATTTCG TCCTGACTGG ATAGGGAAAC TGCTTGAAAA CGGGAAGCGC
AGCGAAGCTG CACCGACTTT TTCCGCTTCC GGGTTATACT TGGCAGGGGT TGCCTATGAT
GCGAGGTGGA AGCTGCCACC CTTTGTCGAG CCCCCTCTGA CCGCAATAGT GCCGGGCACA
AACAGGCCGG CTATCCTCAC ATCATGGGCG ACAAGTGGCG GAAACCCAGT TGCGGGCGCA
ACACCGGAAG TCAGGGATAT ATGTCGATTC GAGTAA
 
Protein sequence
MACAKAVGND GLPNQRPMSR WRASAYTSLL PFPEKCLKIP WTCRYKYHQN HDGSFTVRIA 
MVLEYDGSNF CGWQSQPGGN TVQDAVEAAL SEIAGEAIRV VTAGRTDAGV HAIYQVLHFD
TRAERPMNAW VRGANALLPS GIALLWASPT ADDFHARYCA LERCYLYLLL NHPVRPGLHQ
HRVGWYHHPL RLESMQMGAQ MLVGEHDFSA FRAAACQAKS PVRTLTKLEV TRVGNMVAFE
LRANAFLHHM VRNIVGCLVY VGKGKFRPDW IGKLLENGKR SEAAPTFSAS GLYLAGVAYD
ARWKLPPFVE PPLTAIVPGT NRPAILTSWA TSGGNPVAGA TPEVRDICRF E