Gene Nmul_A0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0450 
Symbol 
ID3785918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp499228 
End bp500937 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content54% 
IMG OID637810526 
Productprolyl-tRNA synthetase 
Protein accessionYP_411150 
Protein GI82701584 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.543628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCAT CAGGTTTTTT CATTTCGACA CTCAAGGAAG CCCCCGCCGA AGCGGAATTG 
ATCAGCCACA AGCTGATGCT GAGGGCCGGT ATCATCCGGC GGCTGGGGAG CGGTCTCTAC
ACGTGGATGC CCCTCGGCTT GAAGGTTCTG CGCAAAGTTG AAAACATCGT GCGGGAGGAA
ATGGATGCCG CGGGCGCACT GGAACTGCTG ATGCCGGCTG TGCAGCCTGC GGAGCTATGG
CGGGAAACAG GGCGATGGGA CGTCTTTGGT CCCCAGATGT TGAAAATTAG AGACAGACAC
GAGCGCGATT TCTGTTTTGG TCCAACCCAT GAAGAGGTCA TAACGGATAT TGCACGGCGT
GAAATCAAGA GTTACCGGCA GTTACCTCTC AATTTTTATC AGATACAAAC CAAATTTCGC
GACGAGGTTC GCCCCCGCTT CGGCGTCATG CGCGCGCGCG AGTTCGTGAT GAAAGATGCC
TATTCGTTCC ACACCGATAT ACCCAGCCTG GAAGAGACCT ATCAGGCCAT GCATGTGGCC
TATTGCCGGA TATTCGATCG CCTGGGGCTG AAGTTTCGTC CCGTCAAGGC CGATACGGGT
GCGATTGGTG GCAGCAGTTC GCATGAATTT CATGTCCTGG CCGATTCCGG CGAAGACGCC
ATCGCTTTTT GTTCCGATTC CGATTACGCA GCGAACGTTG AAATGGCCGA GTCGTTGCCG
CCAGCAGGAC TGCGGGAGGC TGCGGCGGGC GAGATGCAGA AAGTGCGAAC AATCGCCCAA
AAGACATGCG AAGAGGTTGC TGCCTATCTC AATGTATCCA TCGAGCAGAC GGTAAAAACG
CTGGCGGTCA TGGCCAATGG CGGAATGCAT CTTTTGCTGC TGCGTGGCGA TCATCATCTC
AACGAGACAA AAGTTCGAAA GATTCCTTTT CTTTCCGATT TCCGGCTTGC CAGCGAAGAA
GAAATTCGCA CCGAAACAGG ATGTCTTCCC GGGTTTATCG GCCCAGCCGG ATTGTCTCTT
CCGGTTATCG CAGACCTTAC CGTAGCCACC ATGAGTAACT TTGTGTGCGG CGCCAACGAA
GAGGATTACC ATCTCGTCAA CGTCAATTTC GGGCGTGATC TGAAAGAGCC GGATCATGTT
TTCGATATCC GCAACGTGGT TTCCGGTGAC TTGTCGCCGG ACGGAAAGGG TAAGCTGGAA
ATCTGTCGCG GCATAGAAGT CGGCCATATT TTCCAATTGC TCACAAAGTA TTCGGAAGCG
ATGAAAGCCA ATTATCTCGA TGAATCCGGG CAAGCGCGTC CCATGGAAAT GGGCTGCTAC
GGGATCGGGG TTTCACGTAT TGTGGCAGCT GCCATCGAGC AGAACCATGA CGAGCGCGGC
ATTATATTTC CCGCGGCAAT GGCGCCATTT CAGGTAGTCA TCATTCCGAT CGGGTTGAAG
AAGAATGCAG AAGTGAGGGC TGAGGCGGAG AAACTATACG CGACGCTTTC CAGTGTCGGC
ATCGAGGTTC TGCTCGACGA CCGGGATGAC CGCCCCGGTG TCATGTTCGC CGACATGGAA
CTGATCGGTA TTCCTCACCG GGTTGTCGTC GGCGAGCGGG GCTTGAAGGA AGGAAATGCC
GAGTATCGCG GGCGGCGTGA CGAAAAATCG GAGGTCGTCC CCCTTCCCGA GATCGCAGAT
TTTATAAAAT CAAAATTAGC CGGGGGTTGA
 
Protein sequence
MRASGFFIST LKEAPAEAEL ISHKLMLRAG IIRRLGSGLY TWMPLGLKVL RKVENIVREE 
MDAAGALELL MPAVQPAELW RETGRWDVFG PQMLKIRDRH ERDFCFGPTH EEVITDIARR
EIKSYRQLPL NFYQIQTKFR DEVRPRFGVM RAREFVMKDA YSFHTDIPSL EETYQAMHVA
YCRIFDRLGL KFRPVKADTG AIGGSSSHEF HVLADSGEDA IAFCSDSDYA ANVEMAESLP
PAGLREAAAG EMQKVRTIAQ KTCEEVAAYL NVSIEQTVKT LAVMANGGMH LLLLRGDHHL
NETKVRKIPF LSDFRLASEE EIRTETGCLP GFIGPAGLSL PVIADLTVAT MSNFVCGANE
EDYHLVNVNF GRDLKEPDHV FDIRNVVSGD LSPDGKGKLE ICRGIEVGHI FQLLTKYSEA
MKANYLDESG QARPMEMGCY GIGVSRIVAA AIEQNHDERG IIFPAAMAPF QVVIIPIGLK
KNAEVRAEAE KLYATLSSVG IEVLLDDRDD RPGVMFADME LIGIPHRVVV GERGLKEGNA
EYRGRRDEKS EVVPLPEIAD FIKSKLAGG