Gene Nmul_A2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2039 
Symbol 
ID3786729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2333764 
End bp2335062 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content52% 
IMG OID637812127 
Productdiguanylate cyclase 
Protein accessionYP_412725 
Protein GI82703159 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000319282 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGCGC CCTCACGCAA TTTATTACTC TGCAATAAAG GAAAGACCGA ATTGAACCAT 
TCGTTCTCCA TGCTGCAAGC CATCGTAGAA TCGACGCCCT ATGGGCTTCT GGTGACCAAC
AAGCACGGGC ACTTGCTTTG CTATAACCAG CCCTATATGG ATATGTGGCG CATTCCCCCT
GAGGTCATGG TCAATGCGAG ACACCAGATA ATTTTCGGGC ATTACTCCAG TCAATTAAGA
GATCCACAAC AATTCCTTCA CTCAACCGAG GTAATTTATG GCACGTGGTT GCCCGAAAGT
TTCGACATCC TCGAATTCCT CGATGGGCGG GCATTCGAAC GCCATACAAA AGCCACGACC
TTGGAAGGGC CAAACATGGT GCGTGTTTGG AGCTTCAAGG ACATCACTGA ACGCAGGCAG
GCGGAGGCCT ATAAAGCGCA GTTGGCCGCA ATGGTCGAGT CCTCGGACGA TGCAATCATT
GTCAAGGACC TGAACGCCAT TATCACAAGC TGGAATGCCG GGGCGGAACG GATTTTCGGG
TACCAGGCAA GCGAAATAAC AGGCTCTTCC ATACTGGCCT TAATTCCCCC GGAGCGTCAT
GAAGAAGAGA AGGGGATCAT GAGCCTCGTC AAGAGCGGGA ACCGCGTGGA CCATTTTGAA
ACCATGTGGT GGGGAAAGGG CAAGAAACCG ATTGATGTCT CGGTCACGAC ATCGGCAGTG
AAGGATAGTG ACGGCAATAT TGTGGGCATA TCACAGATTG CGCGGGATAT CACCAAGCGC
AAGGAATCAC AGAAACGCAT CGAGTATCTT GCCCATTACG ATCCGCTAAC CGGGCTGCCC
AATCGCGCGT TGCTCGCAGA CAGAATGAAA ACTGCCATCG AGAATGCCAA GCGTTATTCC
TTCCAGTTGG CAGTCCTGTT TGTGGACCTT GACCACTTCA AGCTAATCAA TGATTCGCTT
GGTCATGAAA TCGGTGACAA GCTGCTCAAG ATCGTAGCCG AGCGCATGCG ATCCAGGCTG
CGTCAAACCG ATACCGTCAG CCGGCTGGGA GGTGACGAAT TTATTATCCT GCTGAGCCGG
ATTAATGCAG CATCCGATGC CGCTTGCGTC GCCGAGAAAA TTATTGCAGC ACTATCCCAG
CCCTACCACC TTGAACAGCA TGAGTTGGGG CTGGGCGCGA GCATCGGGAT CAGCATTTAT
CCGGACAGCG GCAAGGATAC CAGCAGCTTG TTGCGCAGTG CCGATGAAGC GATGTACTCT
GCCAAAGGAC AGGGTAGGAA CCGCTATCAT GGTCCGTAG
 
Protein sequence
MIAPSRNLLL CNKGKTELNH SFSMLQAIVE STPYGLLVTN KHGHLLCYNQ PYMDMWRIPP 
EVMVNARHQI IFGHYSSQLR DPQQFLHSTE VIYGTWLPES FDILEFLDGR AFERHTKATT
LEGPNMVRVW SFKDITERRQ AEAYKAQLAA MVESSDDAII VKDLNAIITS WNAGAERIFG
YQASEITGSS ILALIPPERH EEEKGIMSLV KSGNRVDHFE TMWWGKGKKP IDVSVTTSAV
KDSDGNIVGI SQIARDITKR KESQKRIEYL AHYDPLTGLP NRALLADRMK TAIENAKRYS
FQLAVLFVDL DHFKLINDSL GHEIGDKLLK IVAERMRSRL RQTDTVSRLG GDEFIILLSR
INAASDAACV AEKIIAALSQ PYHLEQHELG LGASIGISIY PDSGKDTSSL LRSADEAMYS
AKGQGRNRYH GP