Gene Nmul_A1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1969 
Symbol 
ID3784993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2263566 
End bp2265311 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content58% 
IMG OID637812058 
Producthypothetical protein 
Protein accessionYP_412656 
Protein GI82703090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCCC GTGCTTTTAA TTCCCTTATG ATGGGACTGG TATTGGCTGG GACAAGCGTC 
TTTTGCCCGT CCTCAGAAGC GCGTGACTCG GCTGTGGGCG AGCAAAGCCG AACCCCTGCG
GCGCAGGGAT CAAACCACAA GCATTATGAA GCTTCACCCA TGGCGAATCA TCCAGGACCG
GATGGGCAAC TGGCCCCCAG GTTGCAGAAC CTCGGCAATC ATGCGTTTCC AGTCAGCACG
CCAAATAAAA GAGCACAGCT CTTTATCAAT CAGGGCATCA ATCTGGCTTA CGGCTTCAAC
CATGCCGAAG CGCGACGCGC ATTCCGCGAA GCCGCGCGAC TCGATCCCGC GCTGGCAATG
GCATACTGGG GTCAGGCGCT GGTACTGGGC CCCAACATCA ATGCCCCCAT GGATCCCAAC
GATGAGCCCG ATGCCTTGAA GCTGGTGCAA AAGGCGAAAT CGCTGATGGA GTCGATCTCG
GAAAGGGAAC AGGCACTGAT CAGAGCGCTG GAGAAACGTT ATTCAGGCGA TTCAGGGGAT
CGCGCGGCAA ATGACAAGTC TTATGCTGAA GCAATGCGGA CGGTCCATCG GCGTTTTCCC
GCTGACCCTG ATATCGCAAT GCTCTATGTG GAATCAGTGA TGGATCTCCG CCCCTGGGGA
TACTGGATGC GGGACGGCCA TCCTCATGCG GGAACGGCTG AAATCGTGGC GCTGACCGAG
GAAGTATTGC GCCGCCATCC CGCGCATCCC GCCGCATTGC ACATGCAAAT TCATCTAATG
GAGCCCACGA ACACGCCCGA GCGAGCGGAG AAAGCCGCGG ATGTCCTGCT CCCGCTGATG
CCTGCAGCGG GTCACATGAT ACACATGCCG TCGCACATTT ATCAGCGGGT GGGACGGTAT
GGGGATGCGA TAAAAAGCAA CCGATTGGCA ATAGCGGCCG ACGAGGACTA CATCGCTCAA
TGCCAGGCAC AGGGACTGTA CCCGATGGCC TACTACCCGC ATAACATCCA CTTTCTTTCG
TTTTCTGCCA CTGCAAACGG TCAAAGCAGA ATGGCGATCG AATCCGCCCG CAAGACCGCC
AGCAGGATAG ACGATGCCAC GCTGAAGGAA ATGCCCCTGA CTGCCGTGTT CCGCATGACA
CCCTACTGGG CTCTCGCGAG GTTCGGACAC TGGCAGGAGA TACTCGATGA GCCCTCTCCT
CCCGTCACGA ACGCCTTCCT CAAGGGGAGC TGGCATTATG TCCGGGGTCT CGCATTTGTC
GCAACCGGGC GTCTTTCCCA AGCCGAGCAG GAACTGGGAA CCTTGCGCGA GATCATGAAA
GACCCGAGCC TGGACGGTGC GCTTTTCTCC AAGAATACGC CGCGCACCGT GCTGAGGATT
GCTCCGGAAG TACTGGCCGG TGAAATTGAC GCCGCTCGCG GTAAATTCGA TTCGGCCATA
GCGCATCTTG AACGCGCGAT CCGCTACGAG GATGCTCTGG TTTACACGGA ACCCGCTGAG
TGGCACTATC CGCCGCGGCT CGCGCTGGGC GCGATCCTAC TTGAAGCCGG ATATCCCGAT
GAAGCAGAGA CGGTCTACTG GAGCGACCTG CAACGCAATC GCGACAGTGG TTGGACTCTT
TTCGGGCTGC TGCAGGCCCT GCGCGCCCAG AAAAAGGAAG CCGAAGCCGA AGTTATTGAG
GCGCGCTTCA AAAGGGCATG GGAGCAGGCT GACGTAAAGC TGACGGCGTC ACGCATGGGG
CGATAG
 
Protein sequence
MESRAFNSLM MGLVLAGTSV FCPSSEARDS AVGEQSRTPA AQGSNHKHYE ASPMANHPGP 
DGQLAPRLQN LGNHAFPVST PNKRAQLFIN QGINLAYGFN HAEARRAFRE AARLDPALAM
AYWGQALVLG PNINAPMDPN DEPDALKLVQ KAKSLMESIS EREQALIRAL EKRYSGDSGD
RAANDKSYAE AMRTVHRRFP ADPDIAMLYV ESVMDLRPWG YWMRDGHPHA GTAEIVALTE
EVLRRHPAHP AALHMQIHLM EPTNTPERAE KAADVLLPLM PAAGHMIHMP SHIYQRVGRY
GDAIKSNRLA IAADEDYIAQ CQAQGLYPMA YYPHNIHFLS FSATANGQSR MAIESARKTA
SRIDDATLKE MPLTAVFRMT PYWALARFGH WQEILDEPSP PVTNAFLKGS WHYVRGLAFV
ATGRLSQAEQ ELGTLREIMK DPSLDGALFS KNTPRTVLRI APEVLAGEID AARGKFDSAI
AHLERAIRYE DALVYTEPAE WHYPPRLALG AILLEAGYPD EAETVYWSDL QRNRDSGWTL
FGLLQALRAQ KKEAEAEVIE ARFKRAWEQA DVKLTASRMG R