Gene Nmul_A2413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2413 
Symbol 
ID3785505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2749415 
End bp2750524 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content56% 
IMG OID637812502 
Producttransglutaminase-like 
Protein accessionYP_413094 
Protein GI82703528 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.329453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA GGGATTTTAT CAAACTGGCG GGGGTAAGCG CCGCATTATT TCCCGTGGCG 
CCCTCAGTAT TCGGCCAGCA GTCTGGTCAA CCGGTTATTC CGCCGCGACG CTACACTTAT
CGTGTAACTT ATAACATCGA TCTTCCGGGG GATGGAAAAA AAGCACGTTT GTGGCTACCG
CTGCCGGATA CGGAGGATTC TCCCCACCAG TTTTCCCAGG GAAGTGTCTG GAGCGGGACC
GCAAGCACGG CCAGATTCGA GAACGTTCCT GGAACAACCT CACCCATGTT TTACGCTGAA
TGGAACCGTA GCGGCCCCCG CAGCGTAACG GTGAGCAGCG TGATCAAAAC ATCGGACCGC
GCTGTCAACC TGGAGCGCTA CAGGGAGGGT AATTCGAGCA CCCTTCCCGC GGATGTGAAA
CGTTATCTGC AGCCCACCAA ATTTATCCCG TTGGATGGCA TCGTCCGCAA AACTGCGCTA
TCCATTACCA AGGAGGCCAA AGCCCAATCG CAGCTGCAGA AAGCGCGCGC GATATATGAT
TGGGTAGTCG AAAATTCCTA TCGCGACCCG TCCACGCGCG GCTGTGGACG GGGTAATATC
AAAGCCATGC TGGAAACCGG TCATCTTGGC GGTAAATGCG CCGATCTGAA CGCATTGTTT
GTAGGACTGG CGCGCGCGGC GGGCATTCCG GCCAGGGACA ACTATGGAAT CCGCATTGAC
GAGTCTGCGG CGCATAAAAC GCTTGGCCAA GCCGACGATA TCACCACTGC CCAGCACTGC
CGCCCTGAAT TTTACCTGAC CGGCCTTGGC TGGGTCCCTG TTGATCCCGC GGATGTGCGG
CAACTGGCAC TGGATGAAGA ACTTCCCATT GAGCACCCAC GAGTGATCGA GCTACGTGAA
AAACTTTTTG GTTCATGGGA AATGAACTGG GTGGCATTCA ACCACGGCAG GGATATCAGG
CTGGCGCGAG ACAGCGTCCT GGGTGAACTG CCATTTTTCA TGTACCCTCA GGCCGAAGTA
GCGGGACACG AGCGGGACAG CCTTGAACCG GCGGAGTTTG CTTACAAGAT AACCTCAGCC
CGGCTGGTGG GTACGGGGAT CAAGTTTTAG
 
Protein sequence
MKRRDFIKLA GVSAALFPVA PSVFGQQSGQ PVIPPRRYTY RVTYNIDLPG DGKKARLWLP 
LPDTEDSPHQ FSQGSVWSGT ASTARFENVP GTTSPMFYAE WNRSGPRSVT VSSVIKTSDR
AVNLERYREG NSSTLPADVK RYLQPTKFIP LDGIVRKTAL SITKEAKAQS QLQKARAIYD
WVVENSYRDP STRGCGRGNI KAMLETGHLG GKCADLNALF VGLARAAGIP ARDNYGIRID
ESAAHKTLGQ ADDITTAQHC RPEFYLTGLG WVPVDPADVR QLALDEELPI EHPRVIELRE
KLFGSWEMNW VAFNHGRDIR LARDSVLGEL PFFMYPQAEV AGHERDSLEP AEFAYKITSA
RLVGTGIKF