Gene Nmul_A2351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2351 
Symbol 
ID3784755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2677720 
End bp2678907 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content62% 
IMG OID637812442 
Producthypothetical protein 
Protein accessionYP_413034 
Protein GI82703468 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.212242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGAA TCAAGCAGCT TGCATGGGCG ATGGGTTGCG TATCGCTCAT GAGCGGGTGC 
ACGTTGTATC CGTGGTATCA GAACAAGGCC GGGCAGGCTG TTAACGTCAA GCCGGTGATG
GAAGTCCGCC ACGCGATTCG CAGTGCGGAC GCGCTGTATC AGCTTGGACG ATATTACCAG
GCCAAGGTCG ACTATGCGGA AGCCATTGCA GCCTACGAGA AGGCGCTGGA GGCAGACCCT
CGTCATGTCG AGGCGCACAA TGGCCTGGGC GTGGCCCATT GCCTTCTGGA CCGGCATGAA
CTGGCGCTGC AGTATTTCCG GAAAGCGATC GGGATGGCCC CCCTGGCCGC CCATCTGCAC
AATAATCTAG GCTATGCCCA CCTGGTGCAC GGACAGGAAG CTGAAGCCGT GTCAGCGTTC
GAGCGGGCGC TGTTGCTCGA GCCTCACAAC CAGCGGGCGC GACGCCACCT CGCCGCCGTC
TACAAAAAGG CGGGACTGCA CGACAAGGCT GCTGCGCTGA CCGTGGCACC CTCTGGAGCC
CCTGTGGGAG CCACCAAGGC ACCCCCCACG GCACCCATAC CAACGCCTGC TCCCGGCACT
CCTGCCGGTA CTCCCGCTGG TATTCCTTCT GCTGCCATAT CGGCCATATC GCCTCCCATG
GCAGCCGCAC CGGGCGAGAA ACAGAAGTTG TCATGCAGCG CCGCAGCGCG ACTGCTGCAG
GTTACACCCG GTGTGTTCGA GTTCCGGATG GCCGAAACGG AGGCGATGAC AGCCATGCCT
TCGGGTAAAA TCATCGGCAG GACCGCTCCC CCGCAGGATT CGGGCAAGTT TTCCGGCCAG
GACATCCGCA TCGAGGTCTC GAATGGCAAT GGCTTACCCG GCATGGCCAG GCAGGTATCC
GATTTTCTGC AGCAGAACGG GTTCGCCAGG GCACGCCTCA CCGACCGGCA GCCGTATCAG
CAGGCCCTGA CGGAAATACA CTATCGGCCG GGCCATTCCG GAGTGGCCGA GGAGATCAGC
CGGTTGATGC CAGGGGGGAG CGGGGTCCCC ACAGTGGAGA GTTATAATCT CCGCAGGGAC
ATTCATGTGC GGGTGATGCT GGGCAAGGAC GCTGTGCGCC AGGTAGCTCA CCTGGAGAGT
CCGCAAAAAG TGCAGATTGC GCAAGGAACT GCCGGAGCCG TCGAGTAA
 
Protein sequence
MFRIKQLAWA MGCVSLMSGC TLYPWYQNKA GQAVNVKPVM EVRHAIRSAD ALYQLGRYYQ 
AKVDYAEAIA AYEKALEADP RHVEAHNGLG VAHCLLDRHE LALQYFRKAI GMAPLAAHLH
NNLGYAHLVH GQEAEAVSAF ERALLLEPHN QRARRHLAAV YKKAGLHDKA AALTVAPSGA
PVGATKAPPT APIPTPAPGT PAGTPAGIPS AAISAISPPM AAAPGEKQKL SCSAAARLLQ
VTPGVFEFRM AETEAMTAMP SGKIIGRTAP PQDSGKFSGQ DIRIEVSNGN GLPGMARQVS
DFLQQNGFAR ARLTDRQPYQ QALTEIHYRP GHSGVAEEIS RLMPGGSGVP TVESYNLRRD
IHVRVMLGKD AVRQVAHLES PQKVQIAQGT AGAVE