Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5431 |
Symbol | |
ID | 7381527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 430113 |
End bp | 431513 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643649034 |
Product | chlorohydrolase |
Protein accession | YP_002547271 |
Protein GI | 222106480 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.225763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGA CTTCGAGCGC AATGATCGTC ACCGCCGACA CTCTGTTGAC CATGGATGCG AAAAACAGCG TCATCTCTGA TGGCGCGGTT GCTATTGAGG ACGGTCGTAT CCTGGCGGTC GGTTCGCTGG AGGTGGTGAA GGCCAGCCAT CCGGCTCTGG CGATCAAGAG GATCGACAAT GCCTTGCTGA TGCCGGGGCT GATCAACGCC CATGCCCATT CCGGCTTCCT GCGCGGCACA GCGGAACATC TGCCGGTCTG GGACTGGCTG ACCATCCATA TAAACCCGAT GCACCGCGTG CTTCTGCCGC ATGAAGCCGA GGCGGCGTCC TTTCTGTGCT ACGTCGAATC CGCACTATCC GGCACGACGA CTGTTGTCGA TATGTGGCGC TATATGGATG GCAGTGCGCG GGCGGCACAG TCCATCGGCA CGCGCCTGGT TGCTGTTCCG TATGTTGGTG AGCATCCCGA TTACAATTAT TTCGAAACGC TCGACAACAA TGAGGCGATG ATTGAGACCT GGCATCGCAA GGCGGGGGGC CGCATCAATG TCTGGGTCGG GCTGGAACAT CTGTTTTATG CCGATGCGGC TGGCCAGCAG CGGGCCATCG CCATGGCCAA ACAGTATAAC ACCGGCTTTC ACACCCATTG TTCGGAAGCC GAAGTCGAGG TTGGCGGCTT TATCGACACC TATGGCAAGC GCCCCATGCA TGTTCTGGAG GATCTCGGCT TCTTCGAGGC TCCACGCACC ATGCTGGCCC ATGCCGTCTG GCTGGATGAG GCGGAAATCG AGCTGATTGC CAGATACAAT GTCTCGGTCG CCCATAATCC GGTGTCGAAT ATGAAGCTTG CTTCCGGCAT TGCACCGATT GCCGATATGC TGGCGGCGGG CATTCCGGTC GGTCTTGGCA CGGATGGCGA AAAGGAAAAC AACAATTTCG ACATGTTCGA GGAGATGAAG ACCGCCTCCC TGCTCGGCAA ACTGCGCCAC CGCGATGCCG CCGCCATGGA TAGCTGGCAA TGTCTGCGCA TGGCGACCAT TCTCGGCGCG AGAGCTATCG GTCTTGAGGA TGAAATCGGC TCTATCGAAG TTGGAAAGCG CGCCGATATC ATTGCGGTGC GCACCGATAC GCCGCGTATG ACGCCACTGT TTGCCGACGG TCCCTATTTC AATGTGCAGC ACAATCTCGT CCACGCGGTG CGCGGCGGCG ATGTCGCCAT GACCATGGTG GATGGTCAGG TGATCGTGGA AGATGGCGTG CTGAAAACCG GCGACGTCAA GGCGATCATT GCCGATATCC ATGGCATGGC GCCTTCGCAT TTTGCCCGCC GCGCCGCATG GCTTGCCGAA AACGGCGGCG GCACCAAGCA ATGGATCAGC GAAGCGGAGG GTGTCAAATG A
|
Protein sequence | MSMTSSAMIV TADTLLTMDA KNSVISDGAV AIEDGRILAV GSLEVVKASH PALAIKRIDN ALLMPGLINA HAHSGFLRGT AEHLPVWDWL TIHINPMHRV LLPHEAEAAS FLCYVESALS GTTTVVDMWR YMDGSARAAQ SIGTRLVAVP YVGEHPDYNY FETLDNNEAM IETWHRKAGG RINVWVGLEH LFYADAAGQQ RAIAMAKQYN TGFHTHCSEA EVEVGGFIDT YGKRPMHVLE DLGFFEAPRT MLAHAVWLDE AEIELIARYN VSVAHNPVSN MKLASGIAPI ADMLAAGIPV GLGTDGEKEN NNFDMFEEMK TASLLGKLRH RDAAAMDSWQ CLRMATILGA RAIGLEDEIG SIEVGKRADI IAVRTDTPRM TPLFADGPYF NVQHNLVHAV RGGDVAMTMV DGQVIVEDGV LKTGDVKAII ADIHGMAPSH FARRAAWLAE NGGGTKQWIS EAEGVK
|
| |