Gene Rpal_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0449 
Symbol 
ID6408097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp484575 
End bp485720 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content67% 
IMG OID642710361 
ProductCBS domain containing protein 
Protein accessionYP_001989485 
Protein GI192288880 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGACG GCGACAGAGC CGAGAACGGT TCGCAGGCCT CGCTGCAGGA CTCCACGCGC 
GGCCAGTTAC CCGCGGTGGT GCATCAGGGC GAAGTGCTGC ATCCGCATGG CGGCAGTTGG
CTGATCCGTG CGATCCGTTC GCTGTTCGGC TGGAAGCCCG GCTCGGTGCG CGACGACCTG
CAGGTCGTGC TCGACACCAG CCCGCCCGAC GACACCGGCT TCTCGACGCT CGAGCGCACG
ATGCTGCGCA ACATCCTCGG GCTGCACGAT CGCCGGATCG CCGACGTGAT GGTGCATCGC
GCCGACATCG TCGCGATCAA GCAGGACATC CAGCTCGGCG AATTGCTCAG CCTGTTTCAG
GACGCAGCGC ATTCGCGGCT CGTGGTTTAC AACGAAACGC TCGACGATCC GGTCGGCATC
GTTCACATCC GCGACCTCGT GGCGTTCATG ACCGCAAAGG CGAAGGTGCC GCCGGCGACG
GTCGCCAAGC GCAAGAAGGC GCTGCCCGCG GGCCTCGATC TGCGCGCGAT CGATCTGAAG
ATGCCGCTGA CCGAAACCGG CATCATCCGC AAGCTGCTGT ATGTGCCGCC GTCGATGCGG
GCGATCGACC TGCTGGCGCA GATGCAGGCG GCGCGCATCC ATCTGGCGCT GGTGGTCGAC
GAATACGGCG GTACTGATGG CCTGGTCTCG ATCGAAGATA TCGTCGAACA GATCGTCGGC
GAGATCGATG ATGAACACGA CTCGACCGAG CCGCCGTCGA TCGTGCGCCA GGCCGACGGC
TCGTTCATCG CCGATGCGCG AGCCAGTCTG GAAGACGTCC GCGCCATGAT CGGCGATCAG
TTCGTCACCG GCGAAGCGGG CGAAGACGTC GAAACCCTCG GCGGTTACCT CGTCAACCAC
GTCGGCCGGC TGCCGGTTCG CGGCGAAGTG ATCGCCGGCC CCGGCACCTT CGAATTCGAA
GTGCTCGACG CCGACCCACG GCGGGTGAAG CGGTTGCGGA TCGGACCGCG CAAGGAACGC
CCCGCCCCGC GCACACGCGA CAGCCGGCGG CGCGAGACCG CGACCGATTC CGCCGCGCCG
CAGACTACTG ACAGCGGCGG ATCCACCTCC TCTCCACCTG CCGGCGACGG GACCGGTTCG
CCGTGA
 
Protein sequence
MPDGDRAENG SQASLQDSTR GQLPAVVHQG EVLHPHGGSW LIRAIRSLFG WKPGSVRDDL 
QVVLDTSPPD DTGFSTLERT MLRNILGLHD RRIADVMVHR ADIVAIKQDI QLGELLSLFQ
DAAHSRLVVY NETLDDPVGI VHIRDLVAFM TAKAKVPPAT VAKRKKALPA GLDLRAIDLK
MPLTETGIIR KLLYVPPSMR AIDLLAQMQA ARIHLALVVD EYGGTDGLVS IEDIVEQIVG
EIDDEHDSTE PPSIVRQADG SFIADARASL EDVRAMIGDQ FVTGEAGEDV ETLGGYLVNH
VGRLPVRGEV IAGPGTFEFE VLDADPRRVK RLRIGPRKER PAPRTRDSRR RETATDSAAP
QTTDSGGSTS SPPAGDGTGS P