Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5141 |
Symbol | |
ID | 7380944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 130508 |
End bp | 132733 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643648797 |
Product | hypothetical protein |
Protein accession | YP_002547034 |
Protein GI | 222106243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA ATGAACTCTG GTACGACCGT GCCGCATCTG TCTGGACCGA GGCCCTTCCC GTTGGCAATG GTCGGTTGGG GGCTATGGTG TTCGGAGATG CCTGGAATGA ACGCCTTCAG ATCAATGAGA GCACATTCTG GAGCGGTGGC CCTTACCAGC CGATCAATCC CGATGCCCGC GCTGCCTTGC CTGAGGTGCG TAATCTGATC CTTGCCGAAC GGTATCAGGA GGCAGACCGC AAGGCCTATG AAGGGGCCAT GGCCAAGCCG GATCGGCAGA CCTCCTATCA GCCGATTGGC GATGTCTGGC TTGATTTGCA CCATGATATG ACCGTCACCA ACTATCGCCG CTCGCTTGAT CTGGAAACCG CTGTTGCGGT GACACAGTAT GATTGCCATG GCGTGCATTT TCGCCGCGAT GTGTTTGCCT CCGCCATTCA GGACGTGATC GTCTGCAAGA TTTCGGTGGA TCAGCCCGGT GCCTTGTCGA TGACGGTGAT GCTGAGCAGC CCGCAAAATG GCGATCCGAT TGATATCGCC GATGCCACCC TTGGCTATGA CGGGCGTAAT CGTCGGCAAA ATGGTATTGA TAGCGCGTTG CGCTTCGCTT TCCGGGTGCG GGTGCTGGCC GAGGGGGGCT TTGTCGATAT TGGCGAGGAA ACCATTCGGG TGCGTGAAGC GTCCAGCGTC ATGTTGCTGA TCGATGCAGG GACCAGCTTT CAAAACTACA GGACTGTTGA TGGCGATCCA CAGGCGCAGA TCAAGGCGCG CCTGGATGCG GCGGCTATGC TGTCCTATGA GGCGCTGCTG GAAGCCCATG TCACAGAGCA TCGCCGTCTA TTTAACCGGA TGCAAATTGC CCTCGGTGAC AAGCCCGTGC CAACGCTTCC CACCGACAAG CGCGTTGCCG CCTATGCCGA AGGTGATGAT CCCTCACTGG CCGCGCTCTA TTTGCAGTAT GGCCGCTACC TTGCCATTTC CTGTTCCAGA CCCGGCACGC AGGCGGCCAA TCTCCAGGGC ATTTGGAATG AAGATATTCT GCCAGCTTGG GGCAGCAAAT ACACCGTTAA CATCAATCTG GAGATGAATT ACTGGTTGGC CGATGTTGCC AATCTCTCCG AGACCTTCCT GCCGCTGGTG GAACTGGTGG AGGATGTGGC CGAAACCGGC CGCGAGATGG CCAAGGCCCA TTACGGCGCA CGCGGCTGGG TTCTGCACCA TAACACCGAT ATCTGGCGTG CCACTGGCCC CATTGACGGC CCTCATTGGG GTCTCTGGCC GATGGGCGGT GCCTGGCTCT GCGCCCAGCT TTATGATCAC TATCGCTTCA ATCCCGATCG CGCTGTGCTG GAGCGCATCT ATCCTTTGAT CAAGGGGGCG GTGGAATTTG CGCTGGATAC GCTGGTTGCC TTGCCTGATA GCAATTACCT CGGCACTTGC CCATCACTGT CGCCGGAAAA CTCCCATCCT TTTGGTTCTT CACTCTGCGC CGCACCGGCC ATGGACAATC AGATTCTCCG CGATCTGTTC GAGGCCTTCG CGGATGCCAG CGCCACGCTT GGCCGGGACG GCGAGCTTCG CACAGAGGCT GCCGCCACCC GCGCTCGTTT GCCGGAAGAC CGTATCGGCA AAGGCGGTCA GTTGCAGGAG TGGATGGACG ACTGGGATCT GGACGCGCCA GAGCAGCAGC ATCGCCATGT CTCGCATCTC TATGGGCTTT ATCCGAGCCT GCAAATCGAC CCATTGGAAA CGCCTGAAAT GGCTAAGGCC GCACAGGTTG TTCTGGAGCG GCGCGGCGAT GATGCAACGG GCTGGGGCAT TGGATGGCGG CTGAACCTTT GGGCCAGACT GGGCAATGGC AATCGGGCCG CAGAGGTTCT GGTCAAGCTT TTGACACCGG AGCGCACCTA TCCAAACCTG ATGGACGCCC ATCCGCCTTT CCAGATCGAC GGGAATTTCG GAGGAGCGGC GGGGATTGTG GAAATGCTGG TGCAATCGCG CCCCGGCGAG CTTCGCCTCC TGCCCGCCTT GCCGGAACAA TGGTCAAGCG GCAGCCTGAA GGGCGTCCGC ATCCGTGGCG GTCACACGGT TGATCTCAGC TGGCAAGCAG GAAAACTGAC TTCACTCCGC ATCACGGCTG GCCACTCCGG GCCACTCACC ATCCGGCAAC CTGCTGGTGT CCTTGAGGTT CAACTTAGGG AAGGCGAGGT TTGGGAAGGT CGATAG
|
Protein sequence | MSDNELWYDR AASVWTEALP VGNGRLGAMV FGDAWNERLQ INESTFWSGG PYQPINPDAR AALPEVRNLI LAERYQEADR KAYEGAMAKP DRQTSYQPIG DVWLDLHHDM TVTNYRRSLD LETAVAVTQY DCHGVHFRRD VFASAIQDVI VCKISVDQPG ALSMTVMLSS PQNGDPIDIA DATLGYDGRN RRQNGIDSAL RFAFRVRVLA EGGFVDIGEE TIRVREASSV MLLIDAGTSF QNYRTVDGDP QAQIKARLDA AAMLSYEALL EAHVTEHRRL FNRMQIALGD KPVPTLPTDK RVAAYAEGDD PSLAALYLQY GRYLAISCSR PGTQAANLQG IWNEDILPAW GSKYTVNINL EMNYWLADVA NLSETFLPLV ELVEDVAETG REMAKAHYGA RGWVLHHNTD IWRATGPIDG PHWGLWPMGG AWLCAQLYDH YRFNPDRAVL ERIYPLIKGA VEFALDTLVA LPDSNYLGTC PSLSPENSHP FGSSLCAAPA MDNQILRDLF EAFADASATL GRDGELRTEA AATRARLPED RIGKGGQLQE WMDDWDLDAP EQQHRHVSHL YGLYPSLQID PLETPEMAKA AQVVLERRGD DATGWGIGWR LNLWARLGNG NRAAEVLVKL LTPERTYPNL MDAHPPFQID GNFGGAAGIV EMLVQSRPGE LRLLPALPEQ WSSGSLKGVR IRGGHTVDLS WQAGKLTSLR ITAGHSGPLT IRQPAGVLEV QLREGEVWEG R
|
| |