Gene Rpal_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2197 
Symbol 
ID6409857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2379950 
End bp2381338 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID642712081 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001991193 
Protein GI192290588 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAC GCTGGACACC CGATAGCTGG CGCGCCAAGC CGGTGCAGCA GATGCCGCAA 
TATCCCGACG CGAAGGCGCT TGCGGATGTC GAGGCGCAGC TCGCGAGCTT TCCGCCGCTG
GTGTTTGCGG GCGAGGCGCG CAACCTGAAG AAGGCGCTGG CCACGGTGGC GGCCGGCGAC
GCTTTCCTGC TCCAGGGCGG CGATTGCGCC GAGAGCTTCG CCGAGCACGG CGCCAACAAC
ATCCGCGACC TGTTCCGCGT CTTCCTGCAG ATGGCGATCG TCTTGACCTA CGCCGGCGCC
TCGCCGGTGG TGAAGGTCGG CCGCATCGCC GGCCAGTTCG CCAAGCCGCG CTCCGCTCCG
GTCGAGAAGC GCGACGGCGT CGAGCTGCCG AGCTATCGCG GCGACATCAT CAATGACGTC
GCGTTCACCG AGGAAGCCCG CGTACCCGAT CCGCGCCGGC AGATCGAAGC GTATCGCCAG
TCGGCCGCGA CGCTCAACCT GCTGCGCGCC TTCGCCAAGG GCGGCTACGC CAGCGTCGAG
AACGTCCATA GCTGGATGCT GCAGTCGGTC AGCGACAGTC CGCAGTCGAA GGCCTATGCG
GATCTCGCCG ATCGCGTTTC CGGCGCGCTG GATTTCATGC GCGCCTGCGG CCTGACCTTC
GCCGTCGATT CCTCGCTCGG CACCACCGAT TTCTACACCA GCCACGAAGC GCTGCTGCTC
GGCTACGAGC AGGCGATGAC CCGCGTCGAC TCGACCACGG GTGATTGGTA CGCGACCTCC
GGCCACATGC TGTGGATCGG CGATCGTACC CGTCAGCTCG ACCACGCCCA TGTCGAGTAT
TTCCGCGGTA TCAAGAATCC GATCGGGCTG AAGTGCGGTC CGTCGCTGAA GACCGACGAA
CTGCTCAAGC TGATCGACAT TCTCAATCCC GACAACGAGC CGGGCCGGCT GACGCTGATC
GGCCGTTTCG GCCATGAGAA GATCGGCGAG CACCTCCCGG CGATGGTTCG CGCCGTGAAG
CGCGAGGGCC GGACCGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACGTCG
AACTCCGGCT ACAAGACCCG GCCGTTCGAC CGCATCCTGT CGGAAGTCCG TTCGTTCTTT
GCGGTCCATG CCGCGGAGGG GACTCATGCC GGCGGCGTGC ATCTGGAGAT GACCGGCCAG
AACGTCACCG AGTGTCTCGG CGGCGCCCGC GCCATCACCG ACGAAGACCT CAACAACCGC
TATCACACCG CCTGCGATCC CCGGCTGAAC GCCGAGCAGT CGATCGACAT GGCGTTCCTG
ATCGCGGACC TCCTGAAGCA GGGTCGGGCC GGCAAGGCCA GCCCGCTGCA GGCGGCGGCT
GGCCTCTGA
 
Protein sequence
MSERWTPDSW RAKPVQQMPQ YPDAKALADV EAQLASFPPL VFAGEARNLK KALATVAAGD 
AFLLQGGDCA ESFAEHGANN IRDLFRVFLQ MAIVLTYAGA SPVVKVGRIA GQFAKPRSAP
VEKRDGVELP SYRGDIINDV AFTEEARVPD PRRQIEAYRQ SAATLNLLRA FAKGGYASVE
NVHSWMLQSV SDSPQSKAYA DLADRVSGAL DFMRACGLTF AVDSSLGTTD FYTSHEALLL
GYEQAMTRVD STTGDWYATS GHMLWIGDRT RQLDHAHVEY FRGIKNPIGL KCGPSLKTDE
LLKLIDILNP DNEPGRLTLI GRFGHEKIGE HLPAMVRAVK REGRTVVWSC DPMHGNTITS
NSGYKTRPFD RILSEVRSFF AVHAAEGTHA GGVHLEMTGQ NVTECLGGAR AITDEDLNNR
YHTACDPRLN AEQSIDMAFL IADLLKQGRA GKASPLQAAA GL