Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3091 |
Symbol | |
ID | 4898192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 106023 |
End bp | 107303 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640113693 |
Product | allantoate amidohydrolase |
Protein accession | YP_001044963 |
Protein GI | 126463850 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.60133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.601215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TGCCCCTGGC TGAGGACGCC CCCGCCCGCG CCATCCGCAG CACCATCGAC CGGGTGCTGG CCGAGGTGAA CGCCCTCTCC GAGGGCGGCC CCGGCTGGAC CCGCCCCTCC TATTCCGATC TGGAAAGCCA TGCCCATGCG CTGATCGAGG CCGAGGCCCG CGCGCTCGGC TTGAGCGTCA GCCGCGATCA TGCAGGCAAC CTCTTCGCCC GGATGGAGGG GCGCGACCCG AGCCTTCCGG CCCTCCACTG CGGCTCGCAC CTCGACACGG TGGCCGAGGG CGGCGCCTTC GACGGGCAGG CGGGCGTGGC CGCGGCGCTG GCCCTCGTCG CGGCCATGCG CGAGGCGGGC GTCACGCCCG AGGCCGATTT CGTGCTGACC GTGACGCGGG CCGAGGAGAG CGTCTGGTTC CCCGTCTCCT ATATCGGCTC GCGCGCGGCG CTCGGCCGGC TCCTGCCCGA AGAGCTCGAG GCGCGCCGCG TCGACACCGG CCGGACGCTG GCCGAGCACA TGCGCGAGCA GGGCTTCGAC CCCGACGCGC TGATGCGGGC CGAGCCGCCG AAGCCCGCGC GCTTCCTCGA GTTCCATATC GAGCAGGGCC CCGTCCTCGA CCGGGCGGGC GAGCCCTACG GCATCGTCTC GGCCATCCGC GGCGGTCTGC GCTATCGCGC GGCAAAGGTG CATGGCACCT GGGCCCATTC CGGCGGTGCG CCGCGCGCGG GCCGCGCCGA CGCGGTGGTG GCCTTCGCCG ATCTGGTCAT GGCGATGGAC CGCGCCTGGG AGACGTTCCT GTCGCGCGGC GCGGACCTCA CCGTGACCTT CGGCAAGGTC GATGCAGCCT CGCCCGCCCA TGCCATGGCC AAGGTGCCGG GCGAGCTCGC CTTCTGCCTC GACCTGCGCT CCGAGGATGT GACGGTGCTC GAGGCGGCCG ACCGGGTGCT GCGCGAGGAG ATCCGCCGCA TCGAGACCGA ACGGCCCGGC ATCCGCTTCG ATCTGGGCAC CCAGAGCCGC AGCCAGCCTG CGCGTCTGTC GCCTGCGATG ATCGACTGGG TGGCGGGCGG CGCCGCGCGC CGGGGCGATG AGCCGCGCCG GATGCTCTCG GGCGGCGGCC ACGATGCGGC GGCCTTCGCC AGCGCCGGCT GGGACAGCGT CATGGTCTTC ATCCGCAACT GGAACGGCAG CCATTGCCCC GACGAGGGGA TGGATCCGGC CGACCTCGCC CGCGCGGTGG AGGCGGTCTT CGCCGCGCTG TCGGGGAACG GCTCCCCATG A
|
Protein sequence | MTDLPLAEDA PARAIRSTID RVLAEVNALS EGGPGWTRPS YSDLESHAHA LIEAEARALG LSVSRDHAGN LFARMEGRDP SLPALHCGSH LDTVAEGGAF DGQAGVAAAL ALVAAMREAG VTPEADFVLT VTRAEESVWF PVSYIGSRAA LGRLLPEELE ARRVDTGRTL AEHMREQGFD PDALMRAEPP KPARFLEFHI EQGPVLDRAG EPYGIVSAIR GGLRYRAAKV HGTWAHSGGA PRAGRADAVV AFADLVMAMD RAWETFLSRG ADLTVTFGKV DAASPAHAMA KVPGELAFCL DLRSEDVTVL EAADRVLREE IRRIETERPG IRFDLGTQSR SQPARLSPAM IDWVAGGAAR RGDEPRRMLS GGGHDAAAFA SAGWDSVMVF IRNWNGSHCP DEGMDPADLA RAVEAVFAAL SGNGSP
|
| |