Gene Nmul_A1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1392 
Symbol 
ID3786422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1583745 
End bp1585163 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID637811480 
Producthypothetical protein 
Protein accessionYP_412087 
Protein GI82702521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000235247 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCCA CCATCCAGTT CAGCCAGCCA GACAAGAAAT TCGACATTTT ACAAAAGCTG 
TTCAGCTTCG TTAAAGGCTT CAAAAATCTG CGCCAGCACA TCCTGGAGCA GGGCATATTG
CTGGAAAGGT TGAATTCTGG CGAAATAGAG AATGTGCAGC GAGCGCTTGC AGGTATAAAT
TACCTGGAGG GCAGGGTTAT AGACAACTCG GTGCGTATTT TCGTCACCGA CGGGGAACTG
CGCGCGCTGT TCGATCTGAT GATGCCCGTT TCCCGCAAAC AGAATGATTT TTCCCGGATC
TTGTGGGAAC GTGGTTTTAC GATTGAAGAG CTGTCGCAGG ATCAGGCGGA AAACCTGAGG
AATCAGTTTT CGGCCATCGC CACCGTGACC ATCGGGCCGG ATGTTCCACG GACCAGGATT
TATACTGTTG GCGGGCAGAT TTTTCAGGAA GATGGAGTGC CGCTGTGCGC AAGGGGCTTT
ACCGTTTGCG CATTCGATGC GCTCTCGGTC AATACATTTG TGCGGTGCGG CGCTATGGGC
GCGGTTCAGG AGGATGGATT CTACCGTATC GACTATGCCT GGCGCTCGAA TGGACGAATA
GGCCCTGATC TGTTTGTGCG GGTATTCGAT CCGGAGGGCG ACATTGTAGC TGAAGCAAGG
AAGAATCCGG CTGCGGTTCA GGAATTTCTC GATATTACAG TTAAAACGCT TTGCATCGTT
CGGGGCACAA TTCGCCAAGT GGATGACTTC CCGCTTCCTC ATCTCCTCGT CCGTGCATTT
GATCGGGACA TGCGAGCGGA GACATTGCTG GGTCAGGCAA TGACGGATGC GGAAGGAAGT
TATCAGATTA CATATGGCAC AAACAAGCTC CGGATGAAGG ACAAAGCGGA TCTGATCGTG
CGCGTTTTCG AACCGGCCGA TAGCGAAGGC AAGGAAACAG GAGACGAAAT CGGAGCTTCA
GAAATCATAT TCAATGCTCC GCTACAGCAA GCGGTCGATC TGGAGGTCAA ATCGGGAAAA
TTCCGGGGAC CGTCCGAGTA TGAGCGATAT ATTACGGCCC TGAAGCTGCT CATTGAGGGA
GAGCCTGTTC ACCAATTGAC CGATAAGGAT TTAAGTTTCC TTGGAGGTAA GACAGGCATT
CCGCTGGAAC ACCTGAATTA TCTTCGGCTG GATGATCAAT GGTGTTTTCA TTACAGCATG
GAACCGGCTG TGGTCTATAG TCTATTACGT CAGGGACTTC CTGCCGACCT CCACCACCTG
TCGACTGAAA AACCAACCCG CCTGCATGAG GCGCTGCAGG CCTCCCTGGC GCACAACATC
GCCCCTGCAG TACTTGCCGA TAAGGTTGAT CAGGCCATAA AGCCACTTCT CTCCCTTGCT
GATTCGATGG TCTTTGAGCT TGAAAGAAGG GCAAAATAA
 
Protein sequence
MRATIQFSQP DKKFDILQKL FSFVKGFKNL RQHILEQGIL LERLNSGEIE NVQRALAGIN 
YLEGRVIDNS VRIFVTDGEL RALFDLMMPV SRKQNDFSRI LWERGFTIEE LSQDQAENLR
NQFSAIATVT IGPDVPRTRI YTVGGQIFQE DGVPLCARGF TVCAFDALSV NTFVRCGAMG
AVQEDGFYRI DYAWRSNGRI GPDLFVRVFD PEGDIVAEAR KNPAAVQEFL DITVKTLCIV
RGTIRQVDDF PLPHLLVRAF DRDMRAETLL GQAMTDAEGS YQITYGTNKL RMKDKADLIV
RVFEPADSEG KETGDEIGAS EIIFNAPLQQ AVDLEVKSGK FRGPSEYERY ITALKLLIEG
EPVHQLTDKD LSFLGGKTGI PLEHLNYLRL DDQWCFHYSM EPAVVYSLLR QGLPADLHHL
STEKPTRLHE ALQASLAHNI APAVLADKVD QAIKPLLSLA DSMVFELERR AK