Gene Avi_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1968 
Symbol 
ID7387254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1623281 
End bp1624588 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID643651217 
Producthypothetical protein 
Protein accessionYP_002549413 
Protein GI222148456 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCA AGAAAGGCAA GGCCGAGCGG CGTTCAACGT CGCTCGAAAG CGAAACCGTG 
CCGGTCAGCG CCACCAATTT TATGGAGTTC TTCGGGCTTG GCGGCGGCGC ACTGCCCACG
GTCAATATCG AAACGGCCCT GAAAGTCCCG GCTGTGCAGG CGGCGGTATC GTTTCTGTCG
CGGACCCTCG CAACACTTCC GCTGCATGTC TACCGGACGG GCGAACGCGG ACCGGTGCGG
CTCGGCGGAA AGCTGGCTGT CGTTCTGGAG GAAAACCCGA ATGACGAAAT GGACACCTCA
AAGTTCCGGC GCTTCTTCTG GGAACAGGTG TTTACCGGCG GGCGGGGGCT GGCCTGGATC
GAGCGCAAGG GTGCTGGCAT CGAGGCGCTT TGGCCGATTG ATCCCGGCAG CTGTTCGATC
AGGCGGCGTG GCGGGCGGCT GTTCTACAGT TTCGAGGGCA GGGAATATCC AGCCACCGAC
GTGATCGACA TCCCCTATAT GCTGAAGCGT AATCTGGTCC AGCATCGCGG CCCGATTGCC
ATGGCGGAAA AGGCCATCCA ACTGGCGCTG GCCATGAACG ACTATGCCTC GAACTTCTTT
GCAGGCGGCG GCGTTCCGCC CTTGGCGCTG GAAGGGCCGA TGCCTGCCAA TGACAAGGCC
ATGCAGCGGG CGCGTGAAGA TATCAAGCGG GCGGTGAAGG CGGCGCGAGA CGATCAGCTT
CCGCTGATCC AGTTGCCGGT CGGCTACAAA CTCACCCAAG TGGGTTACGA CCCGGCCAAG
GGGCAGATGA CCGAGGCGCG GCTTTACCAG GTGCAAGAGA TTGCCCGCGC CTATCAGATC
CCGCCGAACT TCCTTCAGGA CTTGAGCCGA GCAACCTTCT CCAATGTCGA GCAGAACGAC
CTCTATCTGG TCAAGCATCT GGTCAGCCAA TGGGCGACGG CGATGGAAGG GGAAATGAAC
CTGAAGATTT TTGGGCGGAT GAATACCCGC CGTTATGTCC GCCACAACCT CGACGGCCTG
ATGCGCGGTG ACTTCAAGAG CCGGTTGGAA GCCTTAGCAA CCGGCGTCAA TTCGGCGCTG
CTGACCCCGA ACGAGGGCCG AGAGATTGAA GGCCGTCCAC GTGATCCGAA CCCGGCTGCC
GACCAACTCT ACATCCAGGG CGCAACCGTC GCCATCGGCA CCAGTGTCAT CGACACAAGT
GCCCCCGGCA CAAATGCGAT GGGCGAGAAT AGCGACCCGC CGCTCAATGA TCCCGCAGCA
GACACAGAAA GGCAGGTGAC CGATGACACC GAAACCGAAA CCGGGTGA
 
Protein sequence
MASKKGKAER RSTSLESETV PVSATNFMEF FGLGGGALPT VNIETALKVP AVQAAVSFLS 
RTLATLPLHV YRTGERGPVR LGGKLAVVLE ENPNDEMDTS KFRRFFWEQV FTGGRGLAWI
ERKGAGIEAL WPIDPGSCSI RRRGGRLFYS FEGREYPATD VIDIPYMLKR NLVQHRGPIA
MAEKAIQLAL AMNDYASNFF AGGGVPPLAL EGPMPANDKA MQRAREDIKR AVKAARDDQL
PLIQLPVGYK LTQVGYDPAK GQMTEARLYQ VQEIARAYQI PPNFLQDLSR ATFSNVEQND
LYLVKHLVSQ WATAMEGEMN LKIFGRMNTR RYVRHNLDGL MRGDFKSRLE ALATGVNSAL
LTPNEGREIE GRPRDPNPAA DQLYIQGATV AIGTSVIDTS APGTNAMGEN SDPPLNDPAA
DTERQVTDDT ETETG