Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1474 |
Symbol | |
ID | 7386445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 1236163 |
End bp | 1238154 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643650853 |
Product | hypothetical protein |
Protein accession | YP_002549058 |
Protein GI | 222148101 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.234073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACC GTGCCGGTTG GCTGGCTCTC GGCGTGCTTG CCATTGCGAC AGTATTGATG GTGTTTTTCG TGCTTCCGCA AATTGGCGGC ACAAAGAAGA CGGCAGAAAC ACCTGCTGCC CCGGCTGAAC AGGCGGCAGC GCCCGAAGCG TCCGGGCCTT CCTCACCCAC GCCTGCCCCC GATGGCCAGG CCGCCGCCAA GATGCAGCGT CTGACCAAGG CCGCAGAGCA ATCGGTGGCC GCTTTGGAAA ATCTGTTTGC CGATCAGAAA ACCCCGGCAC CCGAACTTTA CGCCACAGCC CGTATTGCTG CATTGACAGC CCTGAAAGCC CTATCGGATG CCGATCTTCC CGCCGGGATC GAAGCAGGCC TTGTAGACAG CCTCACCAAG GCAAAGGCCT CGGCTGCCCA TGCCTTGCAG TTGATCGCCA AACTGCCCGC CTCCCCGCAG GATGCCGCCG CCATGATCGC CAATATCGGT CGGGCCATGC GCGGTGAACC GGAAGTGGCT ATCGATCCAG ACATGCCGCG TTTTGATGTG CTGCGCGTTG AAAAAGACGG CTCGACTGTG ATCGCCGGCA GCGCGGCACC GGGTGCCAAG GTCGAGGTCC ATGACGGCTC CAGCAATATT GCCAGCGCCA CCGCCTCTCC AAACGGTGAT TTCGCCATCG TACTCGACAA GCCTCTGTCG CCCGGCGATC ATTCTCTGGA TTTGAAAGCC ACGACCAAGG AAGGCAAGAC CATCGGGTCT GAGGAGCAAG CCACGGTTTC CGTCCCGGCC GATCCATCCG GTGACGTGCT GGCCATGGTG ACCAAGCCCG GCGAAGCCAG CCGGCTGATC AGTGGCCCGC AACAGGATCA AGTCTCGCCA AAGGATCAGA GCCAGCCACA GGCTCCTGAA CAGATCAATG GCGTCGACAA GCAGGGCCGC GTCGCCGCAG CCACGCCTTC TACACCGATG ACATCCCCCG GGTCTGCGTC CAGCCCATCC CTGTCCGCTG CCGATTTGCA GATTACCGCG GTCGAGCTTG AAGGCGACAA GATTTTCGTG GCAGGCAACG CCCAGAAGGG CCGCACCGTG ACCGCCTATG CCGATGGCAA GCAGATCGGC TCGGCGGATG TCGACCAGAA GGGGCATTTC GTGGTCGAAG GCCAGATGCC GCTTTCCATC GGCCAGCATA TCATCAGCGT CGATCTGAAG GATGCCAATG GCAGGGTCAC ACTGCGCGCG TCGGTGCCGT TCAACCGGCC GGAAGGCGAT CAGGTTGCGG TTGTCGCACC GGAAGCCCAG CCCGGCGGCG CCGGCAGCAC CGTCAGCCCG GCCACCGTCG ATAGCAACCT GTTTGATCGG CAGCGCGATA CACTGGCCAA GTCCTTCACG CTGCTTAGCA ACCTGTTTGC CGATGGCAAG ACACCGACGC TGGAAAGCCT GGCGGCATCG CGTTCGGCGC TGGAATTTGC CCTGAAAGCA GTTGCGGATT TCCGCCCGAC CCCCAGCACG GACCAGACCG CCAGCGCCTT CATGGCGCGC ATGGCAGACC AGGCTGACAA GGCACTCGTG GTCCTCAAGC AAGTGCCGAC GGCGTCGGTC ACGGCGATGG CCAATGCCTT GCCGACCCTC AAGTCGCTTG TCGATGCGGC GCTGGAACCC ATGCCGGCCA AAATTGCCGC TGCGCAAGCC GCAGCGACGC AAAACCCGGC ACCGACCTTG GCGCCAGCCG ATGGTACATC GCCGCCGGTG CTGAGCCAGG CGCCGCTGAC CGAAAGCAAG AATGCGGTGA TCATCCGCCG GGGCGATACG CTCTGGCAGA TTTCCCGGCG AGTCTATGGC CAGGGCGTGC GTTACACGAC GATCTATGTC GCCAATGCCG AACAGATCAG CAATCCCGAC CTGATCGAGC CGGGCCAGAC CTTCACCGTG CCCGATCAAT CCATGCCGGA TGACGAAGCC GAAAAAATCC ATCGTAAATG GATGCTGGAG CACAAGCGAT AG
|
Protein sequence | MKNRAGWLAL GVLAIATVLM VFFVLPQIGG TKKTAETPAA PAEQAAAPEA SGPSSPTPAP DGQAAAKMQR LTKAAEQSVA ALENLFADQK TPAPELYATA RIAALTALKA LSDADLPAGI EAGLVDSLTK AKASAAHALQ LIAKLPASPQ DAAAMIANIG RAMRGEPEVA IDPDMPRFDV LRVEKDGSTV IAGSAAPGAK VEVHDGSSNI ASATASPNGD FAIVLDKPLS PGDHSLDLKA TTKEGKTIGS EEQATVSVPA DPSGDVLAMV TKPGEASRLI SGPQQDQVSP KDQSQPQAPE QINGVDKQGR VAAATPSTPM TSPGSASSPS LSAADLQITA VELEGDKIFV AGNAQKGRTV TAYADGKQIG SADVDQKGHF VVEGQMPLSI GQHIISVDLK DANGRVTLRA SVPFNRPEGD QVAVVAPEAQ PGGAGSTVSP ATVDSNLFDR QRDTLAKSFT LLSNLFADGK TPTLESLAAS RSALEFALKA VADFRPTPST DQTASAFMAR MADQADKALV VLKQVPTASV TAMANALPTL KSLVDAALEP MPAKIAAAQA AATQNPAPTL APADGTSPPV LSQAPLTESK NAVIIRRGDT LWQISRRVYG QGVRYTTIYV ANAEQISNPD LIEPGQTFTV PDQSMPDDEA EKIHRKWMLE HKR
|
| |