Gene Nmul_A0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0987 
Symbol 
ID3786587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1144740 
End bp1145816 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content55% 
IMG OID637811070 
Producthypothetical protein 
Protein accessionYP_411682 
Protein GI82702116 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.779307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTAC TCAGCCATTC CTTCAAGGTT CGCCATCTTA TCCTGGCCGC AGCCTTAACT 
ACCGGCCTTG GTTTTGTCAA TCCTGCAAAC GCCGAAATAG TCCTCCTTGT TGACCTTAAC
AGCAGGACAG CAATTAGTCT GGGCACTTTG GGCGGGAACT GGAGCAATGC CTACGGCATC
AACGATGCTG GGCAGGTGGC TGGATACTCT CACACGGCTG AAGGTGGTCA GCATGCCTTC
ATCACCGGTA CTGATGGGGT GGAGATGAGA GACTTGGGCA CCTTGCGGGG GGGTGAGAGC
TATGCGCTCG ACATCAACGA TGCCGGACAG GTAGTGGGAG GCTCTGGCAC TGCTGGAGGC
TATGTCCATG CTTTCATCAC TGGCCCGAAT GGCACGGGGA TGAGAGACCT GGGCACTTTA
GGCGGGCGCT GGAGCTATGC TTTCGGCATC AACGATGCCA GACAGGTGGC TGGATACTCT
CTCACGGCTG ATAGTAATCG TCATGCCTTC ATCACCGGTT ATGATGGCAT GGGGATGAGA
GACCTGGGCA CTTTGGGCGG GAGCTTGAGC GAGGCTTCCG GCATCAACGA TGCCGGACAG
GTGGTAGGAA TGTCTGGCAC AGTTGATGGT AATCTTCATG CCTTCATCAC CGGCCCTGAT
GGGGTGGGGA TGAGAGACCT GGGCACTTTG GGGGGGCGCT GGAGCTATGC CTACGGTATC
AACGATGCCG GACAAGTGGT TGGAAACTCT TCCACGGCTG AAGGTAGTCT CCATGCCTTT
ATCACCGGTC CCGATGGGGT GGGGATGAGA GACCTAGGCA CTTTATTAGG TGGGAACGGA
AGCCAGGCCA ACGGCATCAA CGACATCGGA CAGGTAGTGG GATACTCTTA CACGGCTGAA
GGTTATTACC ATGCCTTCAT CACCGGTCCT GATGGTGAAG GAATGACGGA CCTCAATTCG
TTGGTTGACC TCCCTCAAGG CATGGTTCTA GTCAAGGCAA TGGATATCAA TAACAGGGGT
CAAGTCATTG CTATTGCTAT TCCTACTACT ATCCCGAACC TGAAGCCTAT GCCTTGA
 
Protein sequence
MTLLSHSFKV RHLILAAALT TGLGFVNPAN AEIVLLVDLN SRTAISLGTL GGNWSNAYGI 
NDAGQVAGYS HTAEGGQHAF ITGTDGVEMR DLGTLRGGES YALDINDAGQ VVGGSGTAGG
YVHAFITGPN GTGMRDLGTL GGRWSYAFGI NDARQVAGYS LTADSNRHAF ITGYDGMGMR
DLGTLGGSLS EASGINDAGQ VVGMSGTVDG NLHAFITGPD GVGMRDLGTL GGRWSYAYGI
NDAGQVVGNS STAEGSLHAF ITGPDGVGMR DLGTLLGGNG SQANGINDIG QVVGYSYTAE
GYYHAFITGP DGEGMTDLNS LVDLPQGMVL VKAMDINNRG QVIAIAIPTT IPNLKPMP