Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11600 |
Symbol | |
ID | 7760102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1113129 |
End bp | 1114889 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804062 |
Product | intracellular signalling protein with diguanylate cyclase and phosphodiesterase activities |
Protein accession | YP_002798364 |
Protein GI | 226943291 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.631303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGC CGCCCCCCTC CCCCCAGCCG TCCGCCGAAC AGTTGGAACT GCTCGCCGTT CTCGAACACG CCATGGCCAT CGCCGAATTC GCGCCGAACG GCCGCGTCCT GCGCGTCAAC GACAAGTACC GGCAGATCTT CGGCTACAGC GAGCAATCCA TCGGCCAGCG CCGTCACCAG CAACTCTGCG CGCCCGACCC GACCAACCGG CACGATTTCG ACGGTCTGTG GGCCGGACTG GAAATGGGCC GCCCGGCGAA CGGCCGCTAC CCCTACGCGA GCGCCGACGG CCGATGTCTG TGGCTGGAGT CGACCTACGT CCCCATTCGC GACGACGCCG GACGCCTGAA GCGCATCGTC CAGATCGCCA TCGACGTCAG CGTCCAGACC GAACGCGAGG AAGTCGCACG GCAGCGCTGC CGCCAACTGA TGCTGGAGCG CGAGGAAAGT CACGACCGGA TTCGCCAACT GGCCTTCTAC GACCCCCTCA CCGACCTGCC CAATCGAGGC CTGCTGCTGA TCCAGGCCGA CCAGGCGATC GCCCGGGCCA GGCGCGAGCG CACGGCCTTG AACGTCCTGT TCTTCGACCT GGACCGTTTC AAGCTGATCA ACGACACCCT CGGCCATCCG GCCGGCGACC TCATGCTGCG CACCATCGCC CAGCGCCTGC GCAGCGAACT GCGGACCACC GACATCGTCG GCCGCCTGGC CGGCGACGAA TTCGTCGTGG TGCTCGCCGA CTGCGATCTC CGGCAGACGA CCGAAACCAT CCGGCGCATC CAGAAGCAGT TGTCGGCATC CTGCCAGATC GCCGGAGCGA CCCTCGCCCC CTCGGCCAGC ATCGGTATCA GCCGCTTTCC CGACGACGGC GAGGACATGG AAACCCTGCT GTACTACGCC GACCTCGCCA TGTATCAGGC CAAAAGCAAA GGACGCGGCC AGTTCAGCTT CTTCAGCGAG GAGATGAACC GTCAGGCCCA GGAGCGCCGG ACGCTCGAGA CGGACCTGCG CGAAGCGCTG CGGCGGCGGC AGTTGCAGCT CCACTACCAG CCGCAGATTG ACCTCAGCAG CGGCCGGCTG TGCGGCATCG AAGCCCTGGC CCGCTGGTTC CACCCGCAAC TCGGCAACAT TCCGCCGAGC CGCTTCGTTC CGCTGGCGGA GGAATGCGGG CTGGCCGGCG AGCTGGATCG CTGGGCGCTG GAGGAAGCCT GCCGGCAACT CGCCGCCTGG CGCGAGGCCG GACTCGAACC GGTGACGGTC TCGGTGAATC TGTCGCCGCT CAGCATCCAC GACACCGAAC TGCCCGCTCG GATCGCCGAC ATCCTGCGCC GCCACGCCCT GGCGCCGGCC GCCTTGAACC TGGAAATCAC CCAGGAGGCG CTGCACGGCG GCAACCCCGG CACGCTGAAG ACCCTGCATG CCGTTCAGGC CATGGGCATC GGCCTGACCG TCGACGACTT CGGTACCGGC CAGTCCTGTC TCGGTTACCT GCGCCACCTG CCGATCCGCG CCCTGAAACT GGACCGCAGC TTCGTCCGCG ACCTGGAGCA CGACGAGGCC ACCCGCGCCC TGACCGAGGT GGCCATGCAC ATCGGCGACA GCCTGCGCAT CGCCGTGTTC GCCGAGGGAG TGGAAAACGA GGAACAACGC CGGCTGCTCA CCAACCGGGG CTATCAGGTG GTCCAGGGCT TCCTGCTCTC GCAGCCGCTG TCCGCCGACC AGTTGTCGGA GTGGCTGGCC AGGCGCTGGC CCGACCGCTA G
|
Protein sequence | MKQPPPSPQP SAEQLELLAV LEHAMAIAEF APNGRVLRVN DKYRQIFGYS EQSIGQRRHQ QLCAPDPTNR HDFDGLWAGL EMGRPANGRY PYASADGRCL WLESTYVPIR DDAGRLKRIV QIAIDVSVQT EREEVARQRC RQLMLEREES HDRIRQLAFY DPLTDLPNRG LLLIQADQAI ARARRERTAL NVLFFDLDRF KLINDTLGHP AGDLMLRTIA QRLRSELRTT DIVGRLAGDE FVVVLADCDL RQTTETIRRI QKQLSASCQI AGATLAPSAS IGISRFPDDG EDMETLLYYA DLAMYQAKSK GRGQFSFFSE EMNRQAQERR TLETDLREAL RRRQLQLHYQ PQIDLSSGRL CGIEALARWF HPQLGNIPPS RFVPLAEECG LAGELDRWAL EEACRQLAAW REAGLEPVTV SVNLSPLSIH DTELPARIAD ILRRHALAPA ALNLEITQEA LHGGNPGTLK TLHAVQAMGI GLTVDDFGTG QSCLGYLRHL PIRALKLDRS FVRDLEHDEA TRALTEVAMH IGDSLRIAVF AEGVENEEQR RLLTNRGYQV VQGFLLSQPL SADQLSEWLA RRWPDR
|
| |