Gene Nmul_A1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1143 
Symbol 
ID3784256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1313896 
End bp1315143 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content53% 
IMG OID637811228 
Productcupin region 
Protein accessionYP_411838 
Protein GI82702272 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000830013 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCAC CGTTGCCCCG CTGTATGTCG TGCAGCCATT GTTACCTTCA CCAAAACGCT 
CACACCGGTA ATCCCATGAC GAAAATCCGG CTTTTAGGCG GCCTCTCGCC CAGCGACTTC
CTTCAGGATC ATTGGCAGAA AAAACCTTTG CTGATACGCA AAGCCTTGCC GGATTTCAGC
GGACTGCTGG ATGCCAATGA GCTTATCGAC CTGGCCTGTC AGGAAGATGC GCAATCGCGT
CTGGTTACCC GTAGAAACGG CCGATGGGAG GTGAGGCATG GCCCCTTCGC ACCTCGCGCT
TTCGCACGGC TGCCGCAGAA AGGCTGGACT CTGCTGGTGC AGGACGTCAA TCACTTCCTT
CCGGCGGCGC GTGAACTGCT GCTGAAATTC AACTTCATTC CACATTCCCG GCTCGATGAT
CTGATGGTCA GCTACGCTCC CGAAGACGGG GGCGTGGGGC CCCACTTTGA CTCCTACGAC
GTTTTTCTGC TGCAAGGAAC AGGCCGCAGA CGCTGGCGAA TATCGGGCCA GAAGGACAGA
ACGCTGGTGG CCGCCGCACC GCTCAAGATT CTGCAGGATT TCAGGCCGGA GCAGGAATGG
GTACTGGAAC CAGGCGACAT GCTGTATTTG CCGCCCGGCT ATGCGCACGA TGGAGTTGCG
GTGGAACCCT GCATGACCTA TTCCATCGGT TTTCGCGCAC CCACCTATCA GGAGCTCGCG
ATGCAGTTTC TCGTTCATCT CCAGGACAGC TGTGAAATAG CGGGTATCTA CGAGGATCCG
GATCTCAGGA TTCAAACTCA TCCCGGACAA ATCAGTTCCG CGATGCTGGA TCAGGTCAAC
GCGGCGCTCG ACAAAATCGA GTGGGACAAC GTTGAAGTGG AACGTTTTAT CGGTATGTAT
TTAACCGAAC CCAAACCTCA CGTTTTTTTT ATGCCTCCTC AGGAGCCGAT ATCCGAACGG
AAATTCGTGC ATCAGATAAG AAAAGGAAAA CTGCAACTGG ATCTGAAAAG CCGCATGCTC
TTCAGGGAAA ACAGAATTTT CCTGAACGGA GACGTATATG AAGTAGGAAA AACCGCACAA
CGGATACTAG GAGAGCTGGC CGATCGTCTT GCCTTATCCC CTGTGAGAGA TATCGATGCC
GAAACACAGG CGCTGCTATA TCAGTGGTAC CTCGATGGTT ACGTCGTTTA TGTTGAAGAT
ACCGGGGCAG TTGAGGAAAT CCAGGAATCG ATAATAGAAA GAAAGTAA
 
Protein sequence
MIAPLPRCMS CSHCYLHQNA HTGNPMTKIR LLGGLSPSDF LQDHWQKKPL LIRKALPDFS 
GLLDANELID LACQEDAQSR LVTRRNGRWE VRHGPFAPRA FARLPQKGWT LLVQDVNHFL
PAARELLLKF NFIPHSRLDD LMVSYAPEDG GVGPHFDSYD VFLLQGTGRR RWRISGQKDR
TLVAAAPLKI LQDFRPEQEW VLEPGDMLYL PPGYAHDGVA VEPCMTYSIG FRAPTYQELA
MQFLVHLQDS CEIAGIYEDP DLRIQTHPGQ ISSAMLDQVN AALDKIEWDN VEVERFIGMY
LTEPKPHVFF MPPQEPISER KFVHQIRKGK LQLDLKSRML FRENRIFLNG DVYEVGKTAQ
RILGELADRL ALSPVRDIDA ETQALLYQWY LDGYVVYVED TGAVEEIQES IIERK