Gene Avi_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1072 
Symbol 
ID7387153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp903491 
End bp906448 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content61% 
IMG OID643650561 
Producthypothetical protein 
Protein accessionYP_002548769 
Protein GI222147812 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAT TCAGATACTC ATCCGAGGCC AATTCCCGGC GATTTTTTGA GCAGAAACAG 
ACCCTATCCA GCCCGGGCCT GTTGCGGCTT GCCGCTGGCC TTCTGGTGCT GGTCCTGACA
CTGACCGGGT TTGGCGCGGA CATCAGCCAT GCCGCCGAAC CGGTGAAGAT CTCCCGGGAC
GACACGGCGC TCGACCTGAC CGCCACCACG GAAATCCATC TCAATCAGGG TGAGGCATTC
CAGGTCTCCA CTGCCGCCGG TTCCGATGGC ATCCGCCGCC GCATCGAAGT GCGTGCCTCA
TCAGCGCAGC ACCAGGGCGA CTGGGCGGTG TTCGCGCTCG CCAATGTGTC CGACGAGCAG
TTGGAGCGGG TCATCGTCGC GCCGCATTAT CGTCTCGTCA ATTCCAAGGT GTTCTGGCCG
GATCTCGGCT CGCAGCGGAT CATTTCCATC ACCCCCAGCG AGGGATTTTC GCTGGATCGG
GTGCCGAGCG ACGATGCCGA CGTGTTTCGC ATCACCCTCA ATCCGGGCGC GGTGGTGACG
TTCGTGGCCG AACTGGCAAC GCCGAACCTG CCGCAGATCT ACCTGTGGGA GCCGGATGCC
TATAAGGATA CGATCAACTC GTTCACGCTT TATCGCGGCA TCGTGCTGGG CATTTCCGGC
CTGCTGGCGG TGTTTCTGAC CATTCTGTTC GTGGTGCGCG GCACATCCAT GCTGCCCGCA
GCGGCCTCAC TGGCCTGGGC AGTGCTCGCC TATATCTGCG TCGATTTCGG TTTCCTTGGC
AAGCTGATCA GCATCACCAC CCATAGCGAG CAGATCTGGC GAGCCGGGGC CGAGGTGGGT
CTGGCCTCAG GCTTCGTGAT TTTTCTCTTT ACCTATCTCA ACCTCAATCG CTGGCATGTG
AACCTCGGCT ACGCGACCTT TGCCTGGATC TTGGGCCTGG TGCTGCTGTT TGGCGTGGCA
ATCTACGATC CGTCGATTGC CGCAGGCATT GCCCGACTGT CCTTTGCGCT GACGGCCACG
GCGGGGATCG CGCTGATCGT CTATCTCGGC TTGAACCGTT ATGACCGCGC CATCCTGCTG
GTGCCGACCT GGACGCTGAT CATCGTCTGG CTGATTGCCG GCTGGATGAC GGTGACGGGA
CGGCTGTCGA ACGATATCAT CCAGCCTGCT CATGGCGGCG CGCTGGTGCT GATCGTGCTG
CTGATCGGCT TCACCGTCAT GCAGCACGCC TTTGCTGGCG GTGGCTATAA CCAGGGCCTG
TTTTCCGATC TGGAACGCCA ATCCCTGGCA CTGACTGGGG CTGGCGACAT TGTTTGGGAC
TGGGATGTGG CCCGCGACCG AGTGGTGACC ATTCCCGATA TTTCCACCCG GCTCGGCCTC
AGCCACGGCG CGCTGCACGG GGCTGCCCGC AACTGGATTC CCAGCCTGCA CCCCGATGAC
CGCGACCGGT TTCGCGCCAC ACTGGATGTG CTGCTGGAAC ACAAGCGCGG GCGGCTGAAC
CACGAATTCC GCATCCGCGC CGATGACGGG CATTTCCACT GGCTGCATAT CCGTGCCCGG
CCCGTATTGG GAGCCAACGG CGAAATCATC CGCTGCATCG GTACGATTGC CGATGTCACC
GAACAGAAAA ACACCGTCGA CCGGCTGTTG CAGGATGCGA TCAACGACAA TCTCACCGGC
TTGCCGAACC GGGAAGTGTT TCTGGACCGT TTGCAGACGC TTTTGACCGT GACCGCCACC
GCCGATACGG TGCGCCCGAC GGTCATGATC GTTGACATCG ACAATTTCGG TCGGGTCAAC
GACATGCTCG GTATTTCCGC CGGAGACAAT ATCCTGATCG CGCTGACCCG GCGGCTGCGG
CGGCTATTGA AGCCGCAGGA TACGCTGGCC CGGCTTTCGG GAGACCAGTT CGGCCTGATC
CTGATTTCCG AACGCGATCC GGCCAAGGTC GCCGATTTCG CCGATGCGAT GAACAAGGCG
ATCATGGTGC CGATCAATTT CGCCAATCGG GAGATCAATC TGACCGCCTC CATCGGCCTT
GTCTCCTGGG TGGACCAGCA GGAAAGCGCT GCCGGGCTGT TGTCGGATGC CGAGCTTGCC
ATGTACCGCG CCAAGCGCGG TGGCGGCAAC AAGGTGGAAC CCTTCCGCCC GGCCTTCCGG
GGCACCGTCT CGGAAAAGCT GCAGTTGGAA ACCGAACTTG CCCGCGCGGT CGAGCGCGGC
GAATTGACCA TGGTCTATCA GCCGATCGTG CGGCTGGATG ACGAGGAGCT GGCCGGTTTC
GAAGCGCTGA TGCGCTGGGA ACACCCCAAG CGCGGCACGA TCTCCCCCTC CGAATTCATA
CCCCTGGCGG AAAGCTGCGA CCTGATCATG CCGCTCGGCA TGTTCGCGCT GGAGCGGGCG
GCAACGGATC TGGTGGAATG GGAAAAACAG ACAGGCGAAA TGCCGATTTT CGTCTCCGTC
AACCTGTCCA GCGCCCAGTT GATCAACAAC ACGCTCTATA CCGACATCCG CAGTCTGCTC
AGCCGAGTCA ATTGCAATCC GGCCCGGCTG AAACTGGAGC TGACCGAATC TGTCGTCGTC
GAGAATCCCG AACAGGCAAG GCTGGTGCTG GAAAAGCTGA AGGATATCGG GCTCAGTTTG
GCTCTGGACG ATTTCGGCAC CGGCTATTCC TCGCTGTCCT ATCTGACCCG CTTTCCCTTC
GATACGCTGA AACTCGACAG GGAACTGGTG ACCGACACAA GCGAGCGGCG CAATATCCTG
CTGCGCTCGG TGATCGGAAT GGCCAAGGAC ATGGGCATGG ATGTCGTGGC GGAAGGCATT
GCCAGCGAGG ACGATGGGGA CGAACTGGCG CAGATGGGCT GCCACTATGG CCAGAGCTTC
CTCTACGGCG CTCCGGTCGG CCCGGAAGCG GTCATGCGTC TTTTGAAAGA CCAGCAACAG
CGAGCAAAGC GGGCTTGA
 
Protein sequence
MNIFRYSSEA NSRRFFEQKQ TLSSPGLLRL AAGLLVLVLT LTGFGADISH AAEPVKISRD 
DTALDLTATT EIHLNQGEAF QVSTAAGSDG IRRRIEVRAS SAQHQGDWAV FALANVSDEQ
LERVIVAPHY RLVNSKVFWP DLGSQRIISI TPSEGFSLDR VPSDDADVFR ITLNPGAVVT
FVAELATPNL PQIYLWEPDA YKDTINSFTL YRGIVLGISG LLAVFLTILF VVRGTSMLPA
AASLAWAVLA YICVDFGFLG KLISITTHSE QIWRAGAEVG LASGFVIFLF TYLNLNRWHV
NLGYATFAWI LGLVLLFGVA IYDPSIAAGI ARLSFALTAT AGIALIVYLG LNRYDRAILL
VPTWTLIIVW LIAGWMTVTG RLSNDIIQPA HGGALVLIVL LIGFTVMQHA FAGGGYNQGL
FSDLERQSLA LTGAGDIVWD WDVARDRVVT IPDISTRLGL SHGALHGAAR NWIPSLHPDD
RDRFRATLDV LLEHKRGRLN HEFRIRADDG HFHWLHIRAR PVLGANGEII RCIGTIADVT
EQKNTVDRLL QDAINDNLTG LPNREVFLDR LQTLLTVTAT ADTVRPTVMI VDIDNFGRVN
DMLGISAGDN ILIALTRRLR RLLKPQDTLA RLSGDQFGLI LISERDPAKV ADFADAMNKA
IMVPINFANR EINLTASIGL VSWVDQQESA AGLLSDAELA MYRAKRGGGN KVEPFRPAFR
GTVSEKLQLE TELARAVERG ELTMVYQPIV RLDDEELAGF EALMRWEHPK RGTISPSEFI
PLAESCDLIM PLGMFALERA ATDLVEWEKQ TGEMPIFVSV NLSSAQLINN TLYTDIRSLL
SRVNCNPARL KLELTESVVV ENPEQARLVL EKLKDIGLSL ALDDFGTGYS SLSYLTRFPF
DTLKLDRELV TDTSERRNIL LRSVIGMAKD MGMDVVAEGI ASEDDGDELA QMGCHYGQSF
LYGAPVGPEA VMRLLKDQQQ RAKRA