Gene Avin_11600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11600 
Symbol 
ID7760102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1113129 
End bp1114889 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content68% 
IMG OID643804062 
Productintracellular signalling protein with diguanylate cyclase and phosphodiesterase activities 
Protein accessionYP_002798364 
Protein GI226943291 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.631303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGC CGCCCCCCTC CCCCCAGCCG TCCGCCGAAC AGTTGGAACT GCTCGCCGTT 
CTCGAACACG CCATGGCCAT CGCCGAATTC GCGCCGAACG GCCGCGTCCT GCGCGTCAAC
GACAAGTACC GGCAGATCTT CGGCTACAGC GAGCAATCCA TCGGCCAGCG CCGTCACCAG
CAACTCTGCG CGCCCGACCC GACCAACCGG CACGATTTCG ACGGTCTGTG GGCCGGACTG
GAAATGGGCC GCCCGGCGAA CGGCCGCTAC CCCTACGCGA GCGCCGACGG CCGATGTCTG
TGGCTGGAGT CGACCTACGT CCCCATTCGC GACGACGCCG GACGCCTGAA GCGCATCGTC
CAGATCGCCA TCGACGTCAG CGTCCAGACC GAACGCGAGG AAGTCGCACG GCAGCGCTGC
CGCCAACTGA TGCTGGAGCG CGAGGAAAGT CACGACCGGA TTCGCCAACT GGCCTTCTAC
GACCCCCTCA CCGACCTGCC CAATCGAGGC CTGCTGCTGA TCCAGGCCGA CCAGGCGATC
GCCCGGGCCA GGCGCGAGCG CACGGCCTTG AACGTCCTGT TCTTCGACCT GGACCGTTTC
AAGCTGATCA ACGACACCCT CGGCCATCCG GCCGGCGACC TCATGCTGCG CACCATCGCC
CAGCGCCTGC GCAGCGAACT GCGGACCACC GACATCGTCG GCCGCCTGGC CGGCGACGAA
TTCGTCGTGG TGCTCGCCGA CTGCGATCTC CGGCAGACGA CCGAAACCAT CCGGCGCATC
CAGAAGCAGT TGTCGGCATC CTGCCAGATC GCCGGAGCGA CCCTCGCCCC CTCGGCCAGC
ATCGGTATCA GCCGCTTTCC CGACGACGGC GAGGACATGG AAACCCTGCT GTACTACGCC
GACCTCGCCA TGTATCAGGC CAAAAGCAAA GGACGCGGCC AGTTCAGCTT CTTCAGCGAG
GAGATGAACC GTCAGGCCCA GGAGCGCCGG ACGCTCGAGA CGGACCTGCG CGAAGCGCTG
CGGCGGCGGC AGTTGCAGCT CCACTACCAG CCGCAGATTG ACCTCAGCAG CGGCCGGCTG
TGCGGCATCG AAGCCCTGGC CCGCTGGTTC CACCCGCAAC TCGGCAACAT TCCGCCGAGC
CGCTTCGTTC CGCTGGCGGA GGAATGCGGG CTGGCCGGCG AGCTGGATCG CTGGGCGCTG
GAGGAAGCCT GCCGGCAACT CGCCGCCTGG CGCGAGGCCG GACTCGAACC GGTGACGGTC
TCGGTGAATC TGTCGCCGCT CAGCATCCAC GACACCGAAC TGCCCGCTCG GATCGCCGAC
ATCCTGCGCC GCCACGCCCT GGCGCCGGCC GCCTTGAACC TGGAAATCAC CCAGGAGGCG
CTGCACGGCG GCAACCCCGG CACGCTGAAG ACCCTGCATG CCGTTCAGGC CATGGGCATC
GGCCTGACCG TCGACGACTT CGGTACCGGC CAGTCCTGTC TCGGTTACCT GCGCCACCTG
CCGATCCGCG CCCTGAAACT GGACCGCAGC TTCGTCCGCG ACCTGGAGCA CGACGAGGCC
ACCCGCGCCC TGACCGAGGT GGCCATGCAC ATCGGCGACA GCCTGCGCAT CGCCGTGTTC
GCCGAGGGAG TGGAAAACGA GGAACAACGC CGGCTGCTCA CCAACCGGGG CTATCAGGTG
GTCCAGGGCT TCCTGCTCTC GCAGCCGCTG TCCGCCGACC AGTTGTCGGA GTGGCTGGCC
AGGCGCTGGC CCGACCGCTA G
 
Protein sequence
MKQPPPSPQP SAEQLELLAV LEHAMAIAEF APNGRVLRVN DKYRQIFGYS EQSIGQRRHQ 
QLCAPDPTNR HDFDGLWAGL EMGRPANGRY PYASADGRCL WLESTYVPIR DDAGRLKRIV
QIAIDVSVQT EREEVARQRC RQLMLEREES HDRIRQLAFY DPLTDLPNRG LLLIQADQAI
ARARRERTAL NVLFFDLDRF KLINDTLGHP AGDLMLRTIA QRLRSELRTT DIVGRLAGDE
FVVVLADCDL RQTTETIRRI QKQLSASCQI AGATLAPSAS IGISRFPDDG EDMETLLYYA
DLAMYQAKSK GRGQFSFFSE EMNRQAQERR TLETDLREAL RRRQLQLHYQ PQIDLSSGRL
CGIEALARWF HPQLGNIPPS RFVPLAEECG LAGELDRWAL EEACRQLAAW REAGLEPVTV
SVNLSPLSIH DTELPARIAD ILRRHALAPA ALNLEITQEA LHGGNPGTLK TLHAVQAMGI
GLTVDDFGTG QSCLGYLRHL PIRALKLDRS FVRDLEHDEA TRALTEVAMH IGDSLRIAVF
AEGVENEEQR RLLTNRGYQV VQGFLLSQPL SADQLSEWLA RRWPDR