Gene Nmul_A2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2651 
Symbol 
ID3785262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3040242 
End bp3041543 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content54% 
IMG OID637812740 
Producthypothetical protein 
Protein accessionYP_413330 
Protein GI82703764 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain
[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACCT TCGTTGATTT CAAAATCCGT GGCTTTATCC TCCTTGCAAC ATTTTTCACC 
GGCTTGGGTT TCGGTACCAG CGCCTTCGCT CAAACACTAC TACGTGAATA TCAATACCTC
GTCGACCTCA GCAGCAGGAC AACAACCCGC CTTTATCAAT CCCCTTTCGG CGACGTTTTT
TATCGTAACA TCAACGATTC GGGGCAGTTA GTGGGCAATT TTGGGGAAGC TCCTTTCCAT
GCTTTCATCA CCGGCCCCAA CGGGATAGGT ATGAGAGACT TAGGCACCCT GGGGGATAAT
CCGGCACGTT CGAATTCATC TGCCTTTGCC ATCAACAACT CAGGGCAGGT GGCAGGATTT
TCTGATTCGA TTCGTGACCG TCTTCAGTTT GAGTCCCATG CTTTCATCAC GGGTCCTGAT
GGGATGGGTA TGAGGAGTCT GGGCACCTTG GCCGGTAATC ACCCCGCTGC TTCCAGCAGT
GCTTCTGGCG TCAATGAGGC TGGCCAGGTA GTTGGTGGCT CGGTTGTTGG TGCCTCTTAT
CATGCTTTCA TCACGGGCCC CGGTGGGATA GGTATGAGGG ACTTAGGCAC CTTGGGTGGT
ACTAACAGCC GTGCTTCTGG CATCAATGAG GCTGGCCAGG TAGTTGGTGG CTCGGTTGTT
GGTGCCTCTT ATCATGCTTT CATCACGGGC CCCGGTGGGA TAGGTATGAG GGACTTAGGC
ACCTTGGGTG GTACTAACAG CCGTGCTTCT GGCATCAACG AGGCCGGGCA GGTGATAGGG
AACTCTCTCA CGGCTCAAAA CGTTTGGCAT GCTTTCATTA CGGGCCCGGA CGGGACGGGT
ATGAAAGACC TGGGCACCCT GGGCGGTACT AGCAGCAGTG CTGTTGGCAT CAGCGATATC
GGGCAGGTGG CGGGGAACGC TGACACGGCT GGAGGTGCCT CTCATGCTTT CGTCACCGGG
GCGGATGGGA TAGGTATGAG GGACTTGGGC ACCTTGGGCG GAACTTCTAG CGAGGCGTAT
GGCATCAACG AGGCCGGGCA AGTAATAGGG GGCTCTCTCA CGGCTGAAAA TGTTTGGCGT
GCTTTCATCA CCGGCCCCGA GGGCGAAGGC ATGACAGACC TCAATTCACT GGTTGATATG
CCGACTGGAG AAGTTCTACT CCAGGCTACT GCTATCAATA ACGCAGGGCA AGTCCTTGCA
ATCGGACTAA TCCCTGAACC GGAAATCTAT GCCTTGATAC TCCCTGGGTT AGGGTTGGTC
GGATTTATAG CGCGGCAAAA GAAGGCGAAG AAGCCTTGTT AA
 
Protein sequence
MKTFVDFKIR GFILLATFFT GLGFGTSAFA QTLLREYQYL VDLSSRTTTR LYQSPFGDVF 
YRNINDSGQL VGNFGEAPFH AFITGPNGIG MRDLGTLGDN PARSNSSAFA INNSGQVAGF
SDSIRDRLQF ESHAFITGPD GMGMRSLGTL AGNHPAASSS ASGVNEAGQV VGGSVVGASY
HAFITGPGGI GMRDLGTLGG TNSRASGINE AGQVVGGSVV GASYHAFITG PGGIGMRDLG
TLGGTNSRAS GINEAGQVIG NSLTAQNVWH AFITGPDGTG MKDLGTLGGT SSSAVGISDI
GQVAGNADTA GGASHAFVTG ADGIGMRDLG TLGGTSSEAY GINEAGQVIG GSLTAENVWR
AFITGPEGEG MTDLNSLVDM PTGEVLLQAT AINNAGQVLA IGLIPEPEIY ALILPGLGLV
GFIARQKKAK KPC