Gene Avi_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5072 
Symbol 
ID7381226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp63703 
End bp65487 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content56% 
IMG OID643648740 
Producthypothetical protein 
Protein accessionYP_002546977 
Protein GI222106186 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAACA ATACGAGTGC GACTTCGATC ATCTGCGTAT CATCTCATGA CGCTGCGCCA 
AGTGTGGGCA CGGCAGACAC AGAATTGGCA AGCCCGACAC CGGGACAGAT ACGCGACGCG
CTGGATCAAC TCTGCAACGG CAATACATTC GCAACGGCAA AGCGCTCCCG ACAGTTGCTA
CAGTATCTTG TCGAGGAGGC GCTCGCGGGT CGCGCGAATG CCATTGGCGA ACATGCGATC
GCTCAGGACG TATTCGGCAA AGATGAGCAT TTCGATCCGC GGATCGATAC ATGTGTCCGG
ACCGAAGCCT GGCGTCTTCG CAACAGATTG CAAAGCTATT ACGAACATGA GGGACGCTTC
GATGTCGTGA GGGTGGCGTT CATGCCCCGC TCACTCGTGC CAATCTTCTC CTCGCAGCCA
TCCTTGCCCG CGGCGGATAT AAAAGATTTT CCGCGCCGGA TTGGCATTCA TATGGACTGC
GAACCCGATG CCGGCGGGAA CGAACAGCAC TTGGCGCGAC GGCTACCCGA AGAGATTGCC
AGCAGCCTTT TCAAGCTTTC TTCGATAGTC CCGGTTCTTC CTGGCGGAGC GAGTAGATCA
GAGCTGGAGT TACGGTGCGC CATTCGCTCC GATGAGCAAT GGATCAGGAT CGTGACGACA
ATGGTCGATC TGGATGGCTT GGTTCAAGGA AGTAAAACCT TTTCTTTCTC ACGCAATACC
GCGAAACCCT CTCCCACCTT GATAGCCAAT GCCATCGGCG AAATGGTTTC AGATTCGCTG
TCACGCAACA TCCTGGGTCA GGCGGTGAAC ATACCATGCC ATTCTGACGA GAACGAGAAC
CGGTTTCTTG ACCAGCTTCT GCGCGGTTCC CACATCAACC GGCGTGAGCG GTTGATCGAG
CTGCGCGCCG CCGTCTTGCG CTATGAACAA ATCATCTCCG ACGACCCCTT GGATCAATTG
TCTCACCGGA GGCTTATTGG CGCTTTAGGA CAGTTTTTAT CGCTTGCGCC GGGCTCGATA
TCCAGGGTGA TGCCCAAGCT GGCCGCCTCT GCCCACAGCG CTTTGTCGCT GAATGGCAAC
CTCAGCGACG TCTGGCTGAT CTTGGGTTGG GCGTCGAGCT ACGCCTATGA CTGGCCACAG
ACCGAAGGCG CATGCCGCCA GGCCATTGCC ATCACCCCGC TCGACCCTTC GCCCTACATA
TTGCTTGCGT TGACCTATCT GCAAAGTGGA CAGATCGTTT CTGCGCTGCA AATAGCGGAG
GAAGCGATCA ATCTCGATCC TTATTCCCCC ATGGTGGCCA ATGTTTATGC GCTAACGCTG
AATGCAGCCC GCCGTTTCAG GGAGGCAGCG AAAGTCGCAC GCGACGCCCT TGATGCCGAA
CCCGGATTTG TTAAGTTACG GCTTACCTAT GGAGAAGCCA AGCTCAACAT GGGTCAGATT
GATAGCGCTA TCGAAGAATT TACTGCCGCC TCACGCATCA TGACCGAAGA TGCCACGGCC
TGTGGGCTGC TAGGACTAGC CTATGGGCTT TCAGGCGAAA GATCAGAGGC GGATCGTCTT
CTTTCCAGGG TGAAGCAGTC ACCAAATCTG CGAGGCCAAG CTGTGCATGC GGAGGCGATG
ATCCATCTCG GACTAGGCGC TCGCGATGAA GCCATTAGCG CTTTGGAGCT GGCGGTGACA
CGAAGAGGCA CACCAGGGCT GTTTCTGGCA AATGCCGTTT TCGATCCCAT TCGGGATGAC
AGCCGCTTTT CAAGAATTCA ACATCAGATG GAGCTGGCAC ATTAG
 
Protein sequence
MENNTSATSI ICVSSHDAAP SVGTADTELA SPTPGQIRDA LDQLCNGNTF ATAKRSRQLL 
QYLVEEALAG RANAIGEHAI AQDVFGKDEH FDPRIDTCVR TEAWRLRNRL QSYYEHEGRF
DVVRVAFMPR SLVPIFSSQP SLPAADIKDF PRRIGIHMDC EPDAGGNEQH LARRLPEEIA
SSLFKLSSIV PVLPGGASRS ELELRCAIRS DEQWIRIVTT MVDLDGLVQG SKTFSFSRNT
AKPSPTLIAN AIGEMVSDSL SRNILGQAVN IPCHSDENEN RFLDQLLRGS HINRRERLIE
LRAAVLRYEQ IISDDPLDQL SHRRLIGALG QFLSLAPGSI SRVMPKLAAS AHSALSLNGN
LSDVWLILGW ASSYAYDWPQ TEGACRQAIA ITPLDPSPYI LLALTYLQSG QIVSALQIAE
EAINLDPYSP MVANVYALTL NAARRFREAA KVARDALDAE PGFVKLRLTY GEAKLNMGQI
DSAIEEFTAA SRIMTEDATA CGLLGLAYGL SGERSEADRL LSRVKQSPNL RGQAVHAEAM
IHLGLGARDE AISALELAVT RRGTPGLFLA NAVFDPIRDD SRFSRIQHQM ELAH