Gene Rpal_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3152 
Symbol 
ID6410822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3397536 
End bp3398831 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID642713030 
Productprotein of unknown function DUF21 
Protein accessionYP_001992131 
Protein GI192291526 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTCGG TGGAATTGGC AATCGTTGTC GCCTTGATCG TCGTCAACGG TCTGCTGTCG 
ATGTCCGAAC TGGCGATCGT CTCGTCACGC CCGGCGCGGC TGTCGATCCT GGCGCAGCGC
GGCGTCCGCG GCGCGCGCCA GGCGATGAAG CTGAGCGAAG ACCCCGGCCG GTTTCTCTCC
ACCGTCCAGA TCGGCATCAC CTTGGTCGGC GTGCTGTCCG GCGCGTTCTC CGGCGCGACG
CTGGGCCAGC GCCTGAGCGA CTGGCTGACA GCGTCCGGTG TGCCGTTCGC CGACATCATC
GGCTTCGGCC TGGTGGTGAC GCTGATCACC TACGCGACAC TGATCGTCGG CGAACTGGTG
CCGAAACAGC TGGCGCTGCG CGATCCCGAA GCGGTGGCGG TGAAGGTCGC GCCGGCGATG
GCGCTGCTCG CCAAGATCTC GCTGCCAGTC GTGGTCGTGC TCGACATCTC CGGCAAGGCG
ATGCTGGCGC TGCTTGGCCA GAGCGGCGAA CCTGAGGACA AAATCTCCGA AGAAGAAATC
CATAGCCTGG TGATGGAAGC CGAGACCGCC GGCATACTCG AGCCTGGTGA GCGCCAGATG
ATTGCAGGCG TGATGCGGCT CGGCGACCGC CCGGTCGGCG CGGTGATGAC GCCGCGTCCC
GAGGTCGACA TGATCGACCT GTCCGATCCG CCCGACCAGA TCCGCGCCAC TTTCGCGAGC
AGCCCGCATT CGCGGTTGCC GGCCACGGAT GGAGATCGCG ACGATCCGAT CGGCATTATC
CAATCCAAGG ACGTGCTCGA AGTCTATCTG CGCGGGGAGA CGCCGGACTT CCGGGCGCTG
GTGCGCGACG CGCCGGTGAT CCCGGCCTCC GCCGACGCGC GCGACGCACT AATCATGCTG
CGCAACGCCT CGGTCCATAT GGGGCTGGTG TACGACGAAT TCGGTGGCTT CGAAGGCGTG
GTCAGCACCG CCGATATTCT GGAGTCGATC GTCGGCGCGT TCAGCTCCGA AGACGGGCCG
CCGGAGCCCG CCGCAGTGCG CCGCGACGAC GGCTCGTACC TTGTCGCGGG GTGGATGCCG
GTCGACGAGT TCGGCGACCT GCTGGGCATG CCGGTGCCGG CGCAGCGCGA TTATCACACC
GTCGCCGGTC TGGTGCTGTC GCATCTCGGC GCGCTGCCGA GCGTCGGCGA CAAGTTCGAC
TTTCAGGACT GGCGGTTCGA GATCATGGAC CTTGATCACC GGCGGATCGA CAAGATCCTG
GCGAGCCGCC TGCCGGATGA CGAAGCCTCG CCATGA
 
Protein sequence
MLSVELAIVV ALIVVNGLLS MSELAIVSSR PARLSILAQR GVRGARQAMK LSEDPGRFLS 
TVQIGITLVG VLSGAFSGAT LGQRLSDWLT ASGVPFADII GFGLVVTLIT YATLIVGELV
PKQLALRDPE AVAVKVAPAM ALLAKISLPV VVVLDISGKA MLALLGQSGE PEDKISEEEI
HSLVMEAETA GILEPGERQM IAGVMRLGDR PVGAVMTPRP EVDMIDLSDP PDQIRATFAS
SPHSRLPATD GDRDDPIGII QSKDVLEVYL RGETPDFRAL VRDAPVIPAS ADARDALIML
RNASVHMGLV YDEFGGFEGV VSTADILESI VGAFSSEDGP PEPAAVRRDD GSYLVAGWMP
VDEFGDLLGM PVPAQRDYHT VAGLVLSHLG ALPSVGDKFD FQDWRFEIMD LDHRRIDKIL
ASRLPDDEAS P