Gene Nmul_A0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0752 
Symbol 
ID3786487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp875181 
End bp876371 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID637810837 
Productelongation factor Tu 
Protein accessionYP_411451 
Protein GI82701885 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0050] GTPases - translation elongation factors 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.993523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA GCAAATTTGA GCGGACGAAG CCGCACATCA ACGTAGGGAC GATAGGTCAC 
GTGGACCATG GGAAGACCAC GTTGACGGCG GCGATCACGA TGGTATTGGC GAAGAAGTTT
GGTGGGGAAG CGAAGAGTTA CGCACAGATA GACTCGGCGC CTGAAGAGAA GGCGCGGGGC
ATCACGATCA ATACCTCGCA CGTGGAGTAC GAGACGGAGA AGCGGCATTA CGCGCACGTT
GACTGTCCTG GTCACGCGGA CTATGTGAAG AACATGATCA CGGGTGCGGC GCAGATGGAC
GGTGCGATTC TGGTGGTTTC GGCGGCGGAT GGACCGATGC CGCAGACGCG GGAGCACATT
CTTCTGGCGC GGCAGGTAGG GGTTCCCTAC ATTATTGTCT ACATGAACAA GGCGGACATG
GTGGACGATG CGGAACTTCT GGAGCTGGTG GAAATGGAAG TGCGGGAGCT GTTGTCCAAA
TACAACTTTC CGGGAGATGA CACCCCGATA GTGATCGGTT CTGCACTGAA GGCGCTGGAA
GGCGATCAGA GCGACATAGG GGAGCCCTCC ATCTACAAGC TTGCGGCGGC GCTGGACAGC
TACATTCCGG AGCCCCAGCG GGCGGTGGAC GGGGCATTTC TGATGCCGGT CGAAGACGTT
TTTTCCATAT CGGGTCGTGG CACGGTGGTG ACGGGTCGGG TTGAGCGTGG CGTGATCAAG
GTGGGGGAAG ACATCGAGAT CGTGGGATTG AAGCCCACCA CCAAGACGGT GTGCACGGGT
GTGGAGATGT TTCGCAAGCT TCTGGACCAG GGGCAGGCGG GAGACAACGT GGGCGTATTG
CTGCGGGGCA CCAAGCGCGA GGAAGTGGAG CGTGGCCAGG TGCTGGCCAA GCCCGGGACC
ATCACTCCTC ATACCAAGTT CACAGCCGAG ATTTACGTTC TGAGCAAGGA AGAGGGCGGG
CGTCATACTC CCTTTTTCCA GGGGTACCGG CCGCAGTTTT ACTTCCGCAC GACGGATGTG
ACGGGTGCAA TCGAGTTGCC TGCGGGCACG GAGATGGTGA TGCCCGGGGA CAATGTGTCG
GTGACGGTAA ACCTGATTGC GCCGATTGCG ATGGAAGAAG GTCTGCGTTT TGCGATTCGT
GAAGGCGGCA GGACCGTGGG CGCAGGCGTC GTGGCAAAAA TTATCGAATA G
 
Protein sequence
MAKSKFERTK PHINVGTIGH VDHGKTTLTA AITMVLAKKF GGEAKSYAQI DSAPEEKARG 
ITINTSHVEY ETEKRHYAHV DCPGHADYVK NMITGAAQMD GAILVVSAAD GPMPQTREHI
LLARQVGVPY IIVYMNKADM VDDAELLELV EMEVRELLSK YNFPGDDTPI VIGSALKALE
GDQSDIGEPS IYKLAAALDS YIPEPQRAVD GAFLMPVEDV FSISGRGTVV TGRVERGVIK
VGEDIEIVGL KPTTKTVCTG VEMFRKLLDQ GQAGDNVGVL LRGTKREEVE RGQVLAKPGT
ITPHTKFTAE IYVLSKEEGG RHTPFFQGYR PQFYFRTTDV TGAIELPAGT EMVMPGDNVS
VTVNLIAPIA MEEGLRFAIR EGGRTVGAGV VAKIIE