Gene Nmul_A0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0104 
Symbol 
ID3786371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp109727 
End bp111196 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content59% 
IMG OID637810174 
ProductSel1 repeat-containing protein 
Protein accessionYP_410805 
Protein GI82701239 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTAT CCAAATCAGT GGCAACATCC GGGTTCGTAA CGGCTGCCCC GGTATTGATG 
TTGTTCGCCT CCCTGGCTAT CGCGGGGGAT TTCGAGGATG GGATGAAGTT CGTCCTCAGC
AAGGACTATA CCAAGGCAAT GCAATCTTTC CGGAAAGCGG CCAATGCAGG AAATGCTGAC
GCCCAGTTCA ATCTGGGCGT GCTGTATTCA CGGGGCCGCG GCGTGCCACA GGATCATGAG
CAAGCCGCCA AGTGGTATCG CAGGGCGGCG GAGCAAGGGG ACGCACCGGC ACAATCCATG
CTGGGGTATA TGTATCTGAA AGGCCAGGGC GTCCCGCAGG ATTATCAACA GGCAATGTTC
TGGTATTTCC GAGCAGCCGA CAGCGGATAT GCGGTGGCGC AATACAATCT CGGGGTAATG
TATGCAAAAG GCCAGGGCGT GGAAAAGGAT TATCGGCACG CCCTCTCCTG GTATCTGAAA
GCTGCGGAGC AGGGACACGC ACCTGCGCAG GCAATCATGG GATTCATGTA TCTCAAGGGG
CAGGGGGTCG AGCAGGATGA CCATCAGGCT GTATCCTGGT ATCGCAAGGC AGCCGAGCAA
GGGTATGGCG AAGCGCAATA TGCTCTTGGC GTGCTCTACG CCAAGGGCCG GGGAGTAGCG
CAGAGCAACC AGGAAGCCGC CTCCTGGTAC CGCAAGGCTG CTGAGCAGGG GAACACGGAT
GCACAGTTCA ATCTCGGCAT GATGTTCGCC ACGGGAGAAG GAGTCACGCA GGATTATCGG
CAGGCAGCGT CCTTGTATCG CCAGGCGGCC GATCAGGGAT ATGCGCGGGC CCAGTTCAAA
CTCGGGGTGG CAAATGCCAA AGGGCTCGGT ATTCCGGAGG ACGCTTACGA AGCAGCGGCA
TGGTACCGCA AGGCGGCCGA GCAGGGCTAT GCTCCTGCCC AGTTCAATCT GGGCGTGATG
TATGCGACGG GTAAAGGCGT CATTAGGGAT GAGCGGCAGG CGGTATCATG GTATCGACAG
GCGGCCGAGC AAGGAGACCC GGATGCGCAA TATAACCTGG GGGTAAGGTA TGACACGGGA
CGGGGCATCG AAAAGGATCC ACAACAGGCA GTAGCCTGGT ATCGCAAGGC GGCAGAGCAA
GGCTATGCAC GGGCACAATA CAGCGTGGGC GTGAAGTATG ACAGCGGGCA GGGAGTGCCG
CAAGATTACG CGCAGGCGCT AGCCTGGTAC CTGAAGGCCG CGGAGCAGGG GCATGCGGGC
GCCCAGACCA ATCTCGGCGT GCTGTATTAC AACGGCAATG GCGTGAAGCA GGATTATGTG
GAAGCCGACA AGTGGTTCAG CATCGCCAGC GCCGGCGGCT ACGAGGATGC CAAAGAGAAT
CGCGAACTGA TGGAAAAGCT GATGACACCG ATGCAAATCG CCGATGCGCG ACGGGAGGCG
GATGAATGGG CAAGAGCACA CCAACGGTAA
 
Protein sequence
MHLSKSVATS GFVTAAPVLM LFASLAIAGD FEDGMKFVLS KDYTKAMQSF RKAANAGNAD 
AQFNLGVLYS RGRGVPQDHE QAAKWYRRAA EQGDAPAQSM LGYMYLKGQG VPQDYQQAMF
WYFRAADSGY AVAQYNLGVM YAKGQGVEKD YRHALSWYLK AAEQGHAPAQ AIMGFMYLKG
QGVEQDDHQA VSWYRKAAEQ GYGEAQYALG VLYAKGRGVA QSNQEAASWY RKAAEQGNTD
AQFNLGMMFA TGEGVTQDYR QAASLYRQAA DQGYARAQFK LGVANAKGLG IPEDAYEAAA
WYRKAAEQGY APAQFNLGVM YATGKGVIRD ERQAVSWYRQ AAEQGDPDAQ YNLGVRYDTG
RGIEKDPQQA VAWYRKAAEQ GYARAQYSVG VKYDSGQGVP QDYAQALAWY LKAAEQGHAG
AQTNLGVLYY NGNGVKQDYV EADKWFSIAS AGGYEDAKEN RELMEKLMTP MQIADARREA
DEWARAHQR