Gene Rpal_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1945 
Symbol 
ID6409605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2098215 
End bp2099465 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content66% 
IMG OID642711831 
Productallantoate amidohydrolase 
Protein accessionYP_001990943 
Protein GI192290338 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGA CTGCCTCGAA CCTGCAGATC GATTCCTCGC GGCTGTGGGA CACTATCGTC 
AGCACCGCGC AGTTCGGCGG CACTCCGAAA GGCGGCGTGA AGCGCCTGAC GCTGTCGGCG
GAAGACAAGC AGGTGCGCGA CTGGTTTCGC CAGGCGTGCG AGCAGGCAGG CCTCGAGGTC
AGCATCGATT CACTCGGCAA CATGTTCGCG CTACGCAAGG GCCGCGACAT GACCAAGCCA
CCGATCGGCC TCGGCTCGCA TCTCGATACC CAGCCGACCG GCGGCAAGTT TGACGGCATC
CTCGGCACCC TCGCCGCTCT CGAAGTGATC CGGACCCTCA ACGACGCCGG CATCGAGACC
GAGCTGCCGC TGTGCATCAC CAACTGGACC AATGAGGAAG GCTCGCGATT CGCCCCGGCG
ATGATGGGCT CGGCGGCGTT CGTCGGCGAC TTCACCGTCG AGGACGTGCT GTCGCGCAGG
GATGCGGCCG GCATCAGCGT CGCCGAAGCG CTCGACAGCA TCGGCTATCG CGGCGACAAA
CTGGTTGGCG CCCAACCGTT CACCGGCTTC ATCGAGCTGC ATATCGAGCA GGGCCCGATC
CTGGAGGCGG AAAGCAAGAC CATCGGCGTG GTCGATCACG GCCAGGGCGT GCTGTGGTAC
GACGGCAAGA TCACCGGCTT CGAAAGCCAT GCCGGATCGA CCCCGATGCA TCTGCGCCGC
GACGCGCTGG CGACGCTGTC GGAGATCGTG CTGGCGGTCG AGAAGATCGC AACCGAACTC
GGCCCCAATG CCGTCGGCAC TGTCGGCGAA GCGGTGATCG CCTCCCCGTC ACGCAACGTC
ATTCCCGGCG AGATCGCCTT CACCATCGAC ATGCGCAGCG CCGATGCGGC GATCATGGAT
CAGCTCGACC AGCGGCTGCG CGCCGCGATC GCCGAGATCG CGCCGCGGCG CAAGGTCGAG
GTCGCGCTCG ATCTGGTGTG GCGCAAGGAG CCGACGCACT TCGATCCTGC CCTGGTCGGC
AGCGTCGAGA ACGCCGCCAA CGCCCTCGGC TATCAGAACC GCCGCATCAC CTCCGGCGCC
GGCCACGATG CCTGCAACCT CAACACCAGA ATCCCGACCG CGATGATCTT CGTGCCCTGC
AAGGACGGCA TCAGCCATAA CGAGTTGGAG GACGCGACCC AGCCCGACTG CGCCGCCGGT
GCCAACGTGC TGCTGCACAC CGTGCTGTCA CTCGCCGGCG TCGCCAAGTA A
 
Protein sequence
MTKTASNLQI DSSRLWDTIV STAQFGGTPK GGVKRLTLSA EDKQVRDWFR QACEQAGLEV 
SIDSLGNMFA LRKGRDMTKP PIGLGSHLDT QPTGGKFDGI LGTLAALEVI RTLNDAGIET
ELPLCITNWT NEEGSRFAPA MMGSAAFVGD FTVEDVLSRR DAAGISVAEA LDSIGYRGDK
LVGAQPFTGF IELHIEQGPI LEAESKTIGV VDHGQGVLWY DGKITGFESH AGSTPMHLRR
DALATLSEIV LAVEKIATEL GPNAVGTVGE AVIASPSRNV IPGEIAFTID MRSADAAIMD
QLDQRLRAAI AEIAPRRKVE VALDLVWRKE PTHFDPALVG SVENAANALG YQNRRITSGA
GHDACNLNTR IPTAMIFVPC KDGISHNELE DATQPDCAAG ANVLLHTVLS LAGVAK