Gene Rpal_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3393 
Symbol 
ID6411067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3631744 
End bp3633375 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content66% 
IMG OID642713273 
Productprotein of unknown function DUF935 
Protein accessionYP_001992370 
Protein GI192291765 
COG category[S] Function unknown 
COG ID[COG4383] Mu-like prophage protein gp29 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.219922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACG CCCCGATCAT CTACGGTCCC GACGGCCAGC CTATCCGTCG CGAGGTCCTG 
ACCGCCTCTG TCGCCGGTCC GACCGTCACC GGCGTGCGCT CGCCGTTCGG CGCCTATCCG
GCCGACGGGC TTAATCCGCG GCGACTGGCC TCGATCCTGC GCGAGGCCGA TCAGGGCGAC
CCGATCAGTT ATTTCGAGCT GGCGGAGCAG ATCGAGGAAC GAGACCCGCA CTATCTCGGC
GTGCTGTCGA CGCGCAAGCG CAGCGTCTCT CAGCTCGATA TCACAGTGGA ATCGGCCGAC
GATACGCCCG AAGGAAAGTC GATCGCCGAC ATGGTCGAGG CCTGGCTGCG GCGCGACGAA
TTGCAGTTCG AGCTGTTCGA CATTCTCGAC GCGATCGGCA AGGGCTGGAG CCAGACCGAA
ATCATCTGGG ACACATCCGA GGGCCAGTGG CAGCCGAAGC GGCTGGAGTG GCGCGACCCT
CGTTGGTTCA GGCCGGATCA GCGCGACGGT GTGACGCCGC TACTGCGGAT CGACGCCGGC
GACGCCTTCG CGCAGGGGCT ACCACCGGCC GGACCGAACG GCGGCGGCTA CGCGCCATTG
CCGCCGTTCA AGTTCATCTC GGCGGTGATC CGTGCCAAGA GCGGCCTTCC GGTCCGCTCG
GGCCTCGCGC GGCTGGCGTG CTGGTCATGG ATGTTCAAGG CGTTCACGCA GCGTGATTGG
GCTGTGTTCA CGCAGACGTT CGGCCAGCCG GTGCGGGTCG GCAAGTATCC GGCTGGATCG
TCCGAGAAGG ACAAGGACAC GCTGTTCCGC GCCGTCGCCA ATATCGCCGG CGACTGCGCG
GCGATCATTC CGGAATCGAT GCTGATCGAG TTCATTGAGT CCGCAAACGT CGGTTCAAGC
CATCAGCTCT ACAAGGAGCG CGCGGACTGG CTCGATCAGC AGATGTCCAA GGCCGTGCTT
GGCCAGACCG CCACCACCGA CGCGATTGCC GGCGGCCACG CGGTCGGTCA GGAGCATCGG
CAGGTGCAGG AGGATATCGA GACCGCCGAC GCGAAGGCGC TGGCGGCGAT CATCAACCGC
GACCTCGTGC AGTCCTGGGT GCAGCTCGAA CACGGGCCGC AGAAGGTCTA TCCGCGCCTG
CGGATCGGCC GGCCAGAGAG CAAGAACGTG ACGCAGATCC TCGACGGTAT CAGCCGCGGC
GTGCCGATGG GGATGGCGGT CGAAAAGAGC TACATGAACG ATCTGCTCGG CATCCCGGTG
CCAAGCCCCG GCGCGGACGG CCGCATGCCG GAGCTGCTGA CGCCCTCGGC GTCGGCGTCG
CCGTTCGGAT CGATGTTTCC CTCGGCATCA CCTCAGCAGC GCGCGCTCGC TGCGGCCGAG
ATGATCATGC ACGACGTGCG CGATCCGATC GCTGTGCTAT CCGATCAGGC CGCACGACTA
TGCGCGCCGG GAAGCGATGC GCTGGTCGAC GAAGTGCGCG GCGTGATCGA AACATCGACG
TCGCTGCAGC AGGTGCAGGA GAAGCTGCGC GCGCTGAAGC CGGGCGCCGC CGAAGCACAG
ATGGCCGGGC TGATGCGGAT GGCGCGGGTG ATCGCGAATC TGACCGGCCG CGCCAGCATT
CCCGATGCTT AA
 
Protein sequence
MADAPIIYGP DGQPIRREVL TASVAGPTVT GVRSPFGAYP ADGLNPRRLA SILREADQGD 
PISYFELAEQ IEERDPHYLG VLSTRKRSVS QLDITVESAD DTPEGKSIAD MVEAWLRRDE
LQFELFDILD AIGKGWSQTE IIWDTSEGQW QPKRLEWRDP RWFRPDQRDG VTPLLRIDAG
DAFAQGLPPA GPNGGGYAPL PPFKFISAVI RAKSGLPVRS GLARLACWSW MFKAFTQRDW
AVFTQTFGQP VRVGKYPAGS SEKDKDTLFR AVANIAGDCA AIIPESMLIE FIESANVGSS
HQLYKERADW LDQQMSKAVL GQTATTDAIA GGHAVGQEHR QVQEDIETAD AKALAAIINR
DLVQSWVQLE HGPQKVYPRL RIGRPESKNV TQILDGISRG VPMGMAVEKS YMNDLLGIPV
PSPGADGRMP ELLTPSASAS PFGSMFPSAS PQQRALAAAE MIMHDVRDPI AVLSDQAARL
CAPGSDALVD EVRGVIETST SLQQVQEKLR ALKPGAAEAQ MAGLMRMARV IANLTGRASI
PDA