Gene Nmul_A1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1424 
Symbol 
ID3786622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1634947 
End bp1636146 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content52% 
IMG OID637811512 
Producthypothetical protein 
Protein accessionYP_412119 
Protein GI82702553 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0154311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA TTCTTCTTGC TGCCTTTGTC GTTCTGGAAA TAGGCTTTGC CCCGCCCCTT 
CGCGCCGCCG AATCAGCAGG TTTTGCTCTC AATTATATTG GCCAGCAAAT CGTACCCAAC
AAGACAAGAT TCAAAGGGAC AACGGTCGGG GGGTTGTCAT CTCTGGACTA TAGCGCAAGC
ACTGACCGTT ACCTATCCAT CAGCGACGAT CGCAGCAGAA CCAATCCAGC CCGGTTTTAT
GAATTGTCTC TGGATCTCGC CAAATTCCAG TGCTCGGCCA AGCCGGGTAT GGCAGGCGTG
ACCTTTCAGG CTGTAACCAC GATCCAGCAA GCCGGTGGAG GGGCATTCGA AAAAAACTCC
GTGGATCCGG AAGGTCTCCG TTTTGACGGC AGCCGCAACA AGATTTATTG GAGTGAGGAA
GGCCGCCGGG AGATATCGGG TTTTCGAAGC CCTGCGGTGC GGGAAATGAA TGCTGATGGC
AGACATTCCC GCGATTTCGT TGTTCCTATT TACTACTCTC CCAGTGGCTC CCGTCTTTGG
ACATTTACCG GCAGTAAGGG TGTTTACGAC AATTTGGGAT TTGAGAGTCT GACACTCTCC
ACCGACGGTA CAACCCTGTA TACCGCCACC GAAAACGGCC TGGTTCAGGA TTCTCCCCCT
GCCAATGCCT ATAGAGGCTC ACGCGCACGT ATTCTTGCCT TCGACATTGC CACCGGGAAA
TCAGTCGCGG AATATGCTTA CGATGTTGAA CCTGTTACAT CCGTACCATC CTTGCTCGGC
GGTTTCACCA TCATCGGCGT GAGCGACTTC CTCGCCATCG GCGACCGCCA ATTCATCACT
ATAGAGCGCG CGTTATCCCC CGGCACGATC ACGCCTGGCC GTATTAACAC CGGATATACC
GTCCGGCTTT ATTACGCAGA TGCAAGGGAC GCCACCAACA TTTCCGGAAT GGAATCAATC
GCGGACAAGA ACATCTCTCC GGTAAGAAAA ATTCTCCTGC TTGATATGTC AGACCTGAAA
AATGCGGATG GCTCGGCTCT GGCTATTGGT AACATAGAAG GCATCACCTT TGGTCCCGAA
TTCAGGGGCA AACGCACTAT CTTGCTGGTG GCTGACAACA ATTTCTCCAG AATGCAATTC
ACCCAATTTG TCGCATTGGA AATTGCCTCT GAATCGGAGC TAGTGGAGCG GTTACAATAA
 
Protein sequence
MKIILLAAFV VLEIGFAPPL RAAESAGFAL NYIGQQIVPN KTRFKGTTVG GLSSLDYSAS 
TDRYLSISDD RSRTNPARFY ELSLDLAKFQ CSAKPGMAGV TFQAVTTIQQ AGGGAFEKNS
VDPEGLRFDG SRNKIYWSEE GRREISGFRS PAVREMNADG RHSRDFVVPI YYSPSGSRLW
TFTGSKGVYD NLGFESLTLS TDGTTLYTAT ENGLVQDSPP ANAYRGSRAR ILAFDIATGK
SVAEYAYDVE PVTSVPSLLG GFTIIGVSDF LAIGDRQFIT IERALSPGTI TPGRINTGYT
VRLYYADARD ATNISGMESI ADKNISPVRK ILLLDMSDLK NADGSALAIG NIEGITFGPE
FRGKRTILLV ADNNFSRMQF TQFVALEIAS ESELVERLQ