Gene Nmul_A0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0376 
Symbol 
ID3784071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp410657 
End bp413446 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content53% 
IMG OID637810452 
Producttetratricopeptide TPR_4 
Protein accessionYP_411076 
Protein GI82701510 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCTT TTAGAAAGAC TGTTTTAAAA AAGACTGGGC TGACGCTTGC AATCGCTGCC 
GCCCTCACCG TGAGCAGCAT GCAAGGCTGT AACAAAGCGG TGGATCCGGT GAAACTGATG
GCGGATGCGA AGCGCTACGA AGAAAGCGGG GATCACAAGT CAGCCATCAT CCAGCTAAAA
AATGCCTTGC AGCAGAACCC GGATAATTCG GAAGCGCGAT ATCTGCTTGG CGCAATATAC
AACAAGACTG GGGATTTTCA GTCCGCGGAG AAGGAGCTTC GCAAGGCGTT GAGCTTGGGA
ATGGATGCGG GCAAGGTGCT GCCTGAACTC GGACAAGCCC TGCTCAGGCT AGGCGCCTAT
CAACAAATCC TGGATGAAAC GAAAGAACTG GCCGACAAAA CAAAGTCGGC GCAGATACTT
ACGCTCCGGG GGAATGCACA GCTCGGATTG GGCAAGACCG CGGAAGCAAA GGCACTGTTC
GAGCAAGCGC TCGGACACAA TCCCGATTCT GCCGACGCAC TGATCGGCTT GGCGAAATAT
TCCCTGGTTC AAAGGGATGT GGAAGGCGCA ACCCATTTCT CGGAGCAGGC GGTTTCCCGG
AGCCCACAAA ATGTCGAGGC CTGGTTGTTC AGGGGCGACC TTCTGCGGAT GCAAGGCAAA
TCCGGGGAAG CTCTGGCGGC GTATGACCAG GTGGTAAAGC TGAAACCGAA CGCAGCCATT
GCCTATATCA ACAAAGCATT CATCGAAATT GGAACGGGCA AGTTCGAGGC GGCAAAGGCA
GATATCGACG CGGCACGGAA GATCAGTCCC AGCTTGATGG TGTTTTATAC CCAGGCTTTG
CTCGATTTCA GCCAGCAAAA GCCCGCTGCC GCGCTGGAAT CGCTCCAGCA GGTACTCAGC
AAGTCGCCAG ATCACATGCC CAGCGTGTTG CTCGCGGGCG CAGTCCAGTT TGCTCTCGGA
TCCATGCCCC AGGCGGAACA ACACCTGAAA CATTATCTGG AGAAAGATCC CGGAAACATC
TACGCGCGCA AGCTGCTTGC TTCCGCGCTG TTAAAAAATG GCGAGACGAA ACGCGCGATC
GACATCCTGA CCCCACCGCT CAAAAATGTG AAAGAGGATC CCCAGTTATT CGCCTTGGCT
GGGGAAGCTT ACATGCAAGC CAAGGATTTT GCGAAAGCCA CGGAATACTT TGAAATGGCC
AGCGATATCG CACCGAGAAG TGCGATGCTT CATACCGCTT TAAGCATGAG CAGGTTGGGG
CAGGGGGAAA ACGCCCGGGC CATCTCCGAA CTGGAAACAG CGACAAAACT CGACCCCAAA
TCGCCGCGGG CAGGAGTGTT GCTGGTCATG ACCCATTTAC GCCTCAAAGA ATTCGACAAG
GCGCTCGCCG CGGTAAAGGC ATTGGAGAAG GAGAATCCTG ATAATCCCCT CATTCAAAAC
CTGAAAGGCG GCGTGTATCT TGGCAAGAAC GATATAGCGA ATGCAAGAGC GAGCTTTGAG
AAAGCGCTTG CTATTCAACC AAACTATTTT CCGACTATAG CAAATCTCGC ACGACTCGAC
ATTCAGGATA AAAAACCGGA TGCCGCAAGA AAACGCTTTG AAGCAATTCT GGAAAAGGAC
AAGAAGAATA TCCAGGCCAT GGTCGCGCTG GCGGGTCTTG CTGTCAACCA GGGACAGAAC
CAGAAGGCTA CCGAGTGGCT GGAGCGTGCA ATGCAAGCAA ATCCGGATGT CCTTCAGCCG
GCCATCCTGC TTGGAACACA TTATCTGCGT TTGGGCGAAA AGCAGAAGGC CCTGGCTCTC
GCCAAGAAAC TGGAGGGGAC ACATTCCAAG GACTCCGCTG TGCTGGATTT ACTGGCACAG
GCTCAACTAG CTAACAATGA CAAATCGAGT GCCTTGGAGA GCTACGCCCG GCTCGCTGTC
GTACAACCGG AATCCCCGCT TGTTCAATTC CGCATCGCCT CCATTCATAT GGCAATGCGG
AACCTCTCCG CCGCATCAGA TGCGCTGAAA AAATCGCTGG CTATAAAACC CGATTATCTG
GCTGCGCAAT TGGCGCAGAT CGACATTGAA GCAGAACAGG GTAATTATGA GAAAGCAATC
GCGATGGCGC GCCAAGTTCA GAAGCAGCAT AAAGGGTCAC CCGCAGGTTA TATAGCAGAG
GGCGATCTGT TGCTGAAACA GAAAAAACCC GCGCTTGCCG CAAGCGCCTA TGAACAGGCA
TTCGCGGTTA ATAAAACCTC GCCCTTGTTG ATCAAGCTGC ACGCATCGCT CAGGCAGGCT
GGAAAAGATA AGGAGGCAGA CCTTCGCTTG ATCGAATGGC TGAAAAAACA CCCGAATGAT
TTATCTGCTC GAATGTACCT CGCGGATACG TTTCTCACAG AGGGAAAACT GGCTGCTGCA
GTGGAGCAAT ATCAAACTGT CCTGAAAGAA CAACCGAAAT TTGCTCCGGC GCTTAATAAC
CTGGCTACCG CTTATCAGCG TCAAAAGGAC CCCAGGGCCT TGGAATACGC GGAAAAGGCA
TACCAGCTTG CTACGGAAAA CCCGGCAGTA CTGGATACGC TGGGTTGGGT ACTGCTGGAG
CAGGAAAATA TCGCGCGAGC TCTACCGCTT TTACAGAAAG CCGCTTCTTT GGCGCCTCAA
GCAGGGGAAA TCCGCTACCA TTTTGCATCC GCACTGGTTA AATCTGGCAA TAAAAGCCAG
GCGCGTAAAG AGCTTGAGCA AATTCTGGCT ACTGGAAAAA CCTTTTCAGG CATAGATGAA
GCCAGAGCTC TTCTTGAGCG GATACAGTAG
 
Protein sequence
MRAFRKTVLK KTGLTLAIAA ALTVSSMQGC NKAVDPVKLM ADAKRYEESG DHKSAIIQLK 
NALQQNPDNS EARYLLGAIY NKTGDFQSAE KELRKALSLG MDAGKVLPEL GQALLRLGAY
QQILDETKEL ADKTKSAQIL TLRGNAQLGL GKTAEAKALF EQALGHNPDS ADALIGLAKY
SLVQRDVEGA THFSEQAVSR SPQNVEAWLF RGDLLRMQGK SGEALAAYDQ VVKLKPNAAI
AYINKAFIEI GTGKFEAAKA DIDAARKISP SLMVFYTQAL LDFSQQKPAA ALESLQQVLS
KSPDHMPSVL LAGAVQFALG SMPQAEQHLK HYLEKDPGNI YARKLLASAL LKNGETKRAI
DILTPPLKNV KEDPQLFALA GEAYMQAKDF AKATEYFEMA SDIAPRSAML HTALSMSRLG
QGENARAISE LETATKLDPK SPRAGVLLVM THLRLKEFDK ALAAVKALEK ENPDNPLIQN
LKGGVYLGKN DIANARASFE KALAIQPNYF PTIANLARLD IQDKKPDAAR KRFEAILEKD
KKNIQAMVAL AGLAVNQGQN QKATEWLERA MQANPDVLQP AILLGTHYLR LGEKQKALAL
AKKLEGTHSK DSAVLDLLAQ AQLANNDKSS ALESYARLAV VQPESPLVQF RIASIHMAMR
NLSAASDALK KSLAIKPDYL AAQLAQIDIE AEQGNYEKAI AMARQVQKQH KGSPAGYIAE
GDLLLKQKKP ALAASAYEQA FAVNKTSPLL IKLHASLRQA GKDKEADLRL IEWLKKHPND
LSARMYLADT FLTEGKLAAA VEQYQTVLKE QPKFAPALNN LATAYQRQKD PRALEYAEKA
YQLATENPAV LDTLGWVLLE QENIARALPL LQKAASLAPQ AGEIRYHFAS ALVKSGNKSQ
ARKELEQILA TGKTFSGIDE ARALLERIQ