Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5844 |
Symbol | |
ID | 7380625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 863410 |
End bp | 864651 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643649375 |
Product | hypothetical protein |
Protein accession | YP_002547612 |
Protein GI | 222106821 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA ACGTGCTCAT TATCGGCGCT GGCGGTGTCG CCCAGGTCGT TGCGCATAAA TGCGCCCAGA ATAACGACGT GCTGGGCGAT ATCCATATCG CCTCGCGCAC CAAGGGCAAA TGCGACGCCA TCATCGCCTC CGTGCATGAA AAGAACGCCA TGAAGCAGCC GGGCGTGCTG GAAGGCCATG CGCTTGACGC CCTGGATATC GACGCCACCA AGGCGCTGAT TGAAAAGACC GGCTCGCAGA TCGTCATCAA TGTCGGCACC GCTTTCCTCA ACATGTCGGT GCTGCGCGCA TGCATGGATA CCGGCGTTGC CTATATCGAC ACGGCGATCC ACGAAGAGCC GAACAAGATC TGTGAGACGC CGCCATGGTA TGGAAACTAC GAATGGAAGC GGGCCGCGGA ATGTGAAGAG AAGGGCATTA CCGCCATTCT CGGCGCTGGT TTCGACCCCG GCGTTGTCAA TGCCTATGCC CGCCTTGCCA AGGATGAATA TCTCGACAAG GTCACGGATG TCGATATCGT CGATATCAAC GCTGGCAGCC ATGGCAAATA TTTTGCCACC AACTTCGACC CCGAAATCAA CTTCCGCGAA TTCACGGGCG TTGTCTATTC GTGGCAGAAG GGCGAATGGC AGGTCAACAA GATGTTCGAG ATCGGCAAGG ATTACGACCT GCCGGTGGTC GGCACGCGCC GCGCCTATCT CTGCGGTCAT GACGAAGTGC ATTCGCTGGC CAAGAACATG GACGGTGCCG ATGTGCGCTT CTGGATGGGC TTCGGCGATC ACTATATCAA TGTCTTCACC GTGCTGAAGA GCATTGGTCT GCTTTCCGAA CAGCCGGTCA AGTTGGCCGA TGGCTCGGAC GTCGTGCCGC TGAAAGTGGT CAAGGCCTGC CTGCCCGACC CGTCATCGCT GGCACCTGAC TATGAAGGCA AGACCTGCAT TGGCGATTAC GTGAAGGGGC TGAAGGACGG CAAGGAAAAG ACCGTCTTCA TCTACAATGT CGCCGACCAT AAGGAAGCCT ATAACGAAGT CGGCTCCCAG GGCATTTCCT ACACGGCTGG CGTTCCGCCG GTGGCGGCTG CCATGCTGGT GGCGACCGGC GAATGGGACG TGAAGAAAAT GGCCAATGTC GAGGAACTGC CGCCACGGCC TTTCCTCAAC ATCCTCAACC ATATCGGCCT GCCGACCCGC ATCAAGGATG AAGACGGCGA CCGGGCGTTG GATTTTTCGT AA
|
Protein sequence | MKKNVLIIGA GGVAQVVAHK CAQNNDVLGD IHIASRTKGK CDAIIASVHE KNAMKQPGVL EGHALDALDI DATKALIEKT GSQIVINVGT AFLNMSVLRA CMDTGVAYID TAIHEEPNKI CETPPWYGNY EWKRAAECEE KGITAILGAG FDPGVVNAYA RLAKDEYLDK VTDVDIVDIN AGSHGKYFAT NFDPEINFRE FTGVVYSWQK GEWQVNKMFE IGKDYDLPVV GTRRAYLCGH DEVHSLAKNM DGADVRFWMG FGDHYINVFT VLKSIGLLSE QPVKLADGSD VVPLKVVKAC LPDPSSLAPD YEGKTCIGDY VKGLKDGKEK TVFIYNVADH KEAYNEVGSQ GISYTAGVPP VAAAMLVATG EWDVKKMANV EELPPRPFLN ILNHIGLPTR IKDEDGDRAL DFS
|
| |