Gene RPB_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1143 
Symbol 
ID3909231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1313352 
End bp1316315 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content69% 
IMG OID637883037 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_484764 
Protein GI86748268 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.983501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CTTCACGCAT CAGCGGCGGC CTGATTGATC GCGACAAGCC GCTGCGATTT 
TCCTTCGACG GCACGGCGAT GACCGGCTTC GCCGGCGACA CGCTGGCCTC GGCGCTGGTG
GCCAACGGCA CCCGCCTGGT CGGCCGCTCG TTCAAATATC ATCGGCCGCG CGGCATTTTC
TCCGCCGGCT CCGAAGAGCC GAACGCGCTG GTCGAATTGC GCAGCGGTGC GCGGCGTGAG
CCCAACACCA AGGCGACCAC GGTCGAGCTC TATGACGGCC TCGAGGCGCA CAGCCAGAAC
CGCTGGCCGT CGCTGGCGTT CGATTGGCGC GCGGTGCATC AGCTCGCGTC GCCGCTGATC
GTCGCCGGCT TCTACTACAA GACCTTCATG TGGCCGGCCG CGTTTTGGGA AAAGCTCTAC
GAGCCGCTGA TCCGCCGCTC CGCCGGGCTC GGCCGCCTCA GTGGCGAGCC CGATCCGGAC
ACATATGAGA AAGCCACCGC GTTCTGCGAC CTGCTGATCA TCGGCGGCGG CCCCGCGGGC
CTTGCCGCGG CGCTGGCGGC GGGGCGTGCC GGCGCGCGCG TGATCCTGGT CGAAGAAGAC
TTTGCGCTCG GCGGCCGGCT GCTGTCCGAG CTTTGCGAGA TCGACGGACT GTCGGGGGCC
GGGTGGGCAC AACTGGCGGA AGCCGAACTC GCAACCCTGA GTAATGTCCG GATCCTGCGC
CGGTCCAGCG TGTTCGGCGT CTATGATGAT GAGTTCGGCG TGATCGAGCG CGTCGCCGAT
CATCTGCCGG TGCCGCCGGC GTTTACGCCG CGGCAGCGGC TGTGGAAGAT CGTGGCGCGG
GAGTCGTTGC TGGCGACCGG CGCGACCGAG CGGCCGATCG TGTTCGGCGG CAACGACCGG
CCCGGCGTGA TGCTGGCCTC CGCTGTGCGG AGCTACGTCA ACCGCTTCGC CGCAGCGCCG
GGACAGCGCG CGGTGGTGTT CACGACCAGC GACGACGGCT GGCGCAGCGC GGCTGATCTG
TCGCGCGCCG GAATTGTAGT CGCTGCCGTA GTCGATCCGC GCCGCGAGGT CGCCGCTTCC
ATCCGCGCGT TGGCCGGCAA CGCGCCGGTG CATCTTGGCG CGTCGGTCAC CGACGCCATC
GGCGGGCAAT CGCTGCGTGC GGTCGAGATC GTCGATGCCG CCGGCAAACG GCAAAAACTC
GCCGCCGATC TGCTCGCTGT GTCCGGCGGC TGGAATCCGA ACATCGCGCT CGCCACCCAT
CTCGGCGGCA AGGGCGAATG GAATCCCGAG ACATCGGCGT TTCTCGCGGC CGGCGCGCCG
AAGGCGATGA CCATCGCGGG TGCCGCGGCC GGCCGTTTCA CGCTGGCGCA GGCGTTGGAG
GACGGCGCGC GCTGGGGCGC GGAGGCCGCA TCGCGTTGCG GCCATGCCGG CGCCGCGCAG
CCGGCCTATC GCGCCAGCGA CGAAGCTTTC GCCGTGACGC CGCTGTGGCA GGTCGCAGGC
GCCCGCAGCA AGGCCTTCGT CGATCTGCAG AACGACGTCA CCGCCGCCGA TATCGCGCTC
TCGGCGCGCG AGGGCTTTCG CTCGGTCGAG CACCTGAAGC GCTACACCAC GCTCGGCATG
GCGACCGACC AAGGCAAGAC CTCCAACGTC AACGGCCTGG CGATGATGGC GGCGCTCACC
GAGCGCAGCA TCGCCGCCGC AGGCACCACG CGGGCCCGGC CGCCGCAGGT GCCGGTCGCG
ATCGGCGCAT TCGGCGGCCT CAGCACCGGC AAGCATTTCA AGCCGACGCG TCTCACCGCG
ACTCACGACT GGTCCGCCCA GCAGGGCGCG AGTTTCGTCG AGACCGGGCA GTGGTTGCGC
GCGCAATGGT TCGCACGGCC GGGCGAGACC GACTGGCTGC AAAGCGTGTC GCGCGAAGTC
GATGCCGTGC GGAGCGCGGT CGGGATTTGC GACGTCTCCA CCCTCGGCAA GATCGCGCTG
TGCGGCGCCG ACGTCGGCGT GTTTCTCGAC CGAGTCTACA TCAACACCTT CTCGACGCTG
GCGGTCGGCA AGGTGCGCTA TGGCGTGATG CTGCGCGAGG ACGGCTTCGT CATGGACGAC
GGCACAACCG CGCGGCTCGC CGAGGATCAC TACGTGATGT CGACCACCAC CGCGAACGCG
GTGAAGGTGA TGCAGCATCT CGAATTCTGC CATCAGGTGC TGTGGCCCGA GCTCGACGTG
CAGATGGTCT CGGTCACCGA GCAATGGGCG CAGGTCGCGG TCGCCGGGCC TCGGTCCCGC
ACGCTTCTGC AGAATCTGTT CGGGCCGGGT GTCGATCTGT CGGATGCGGC GTTTCCCTAT
ATGGCGTGCG GCGAATTCCG CCTCGGCGAG GTGCCGGCGC GGCTGTTCCG GATCTCGTTC
TCCGGCGAGC GCGCCTACGA GATCGCAGTG CCGGCCGGCT ATGGCGATGC GCTGATGCGC
GCGCTGATGG CGGCGGGTGA AGGCCTCGGC GTCGTGCCCT ACGGCACCGA GGCGCTCGGC
GTGATGCGGA TCGAGAAGGG CCACGCCGCC GGCAATGAAC TCAACGGCCA GACGGTGGCG
CGCGATCTCG GCCTCGGCCG GATGATGTCG ACGAAGAAAG ACTTCATCGG CCGGGTGATG
GCGGGCCGGC CCGCGCTGAT CGATCCGGCG CGGCCGACGC TGGTCGGCCT GCGTCCGGTC
GATCGCAACG ACCGCCTGCG CAACGGCGCG CATCTGTTCG CGCCCGGCGC AGCGCCGTCG
CCGGAGACCG ATCAGGGCTT CGTCACGTCG TCGGCGTTCA GCCCGTCGCT CGGCCACTGG
ATCGCGCTGG CGCTGCTGTC GCGCGGTCCG GATCGGATCG GCGAACGTAT TCGCGTCTAC
GATCCGATCC GCGCGCATGA TTTCGAGGCC GAGATCGTGT CGCCGGTGTT TGTCGATCCG
GAAGGAGAGC GGCTGCGTGG CTGA
 
Protein sequence
MSAASRISGG LIDRDKPLRF SFDGTAMTGF AGDTLASALV ANGTRLVGRS FKYHRPRGIF 
SAGSEEPNAL VELRSGARRE PNTKATTVEL YDGLEAHSQN RWPSLAFDWR AVHQLASPLI
VAGFYYKTFM WPAAFWEKLY EPLIRRSAGL GRLSGEPDPD TYEKATAFCD LLIIGGGPAG
LAAALAAGRA GARVILVEED FALGGRLLSE LCEIDGLSGA GWAQLAEAEL ATLSNVRILR
RSSVFGVYDD EFGVIERVAD HLPVPPAFTP RQRLWKIVAR ESLLATGATE RPIVFGGNDR
PGVMLASAVR SYVNRFAAAP GQRAVVFTTS DDGWRSAADL SRAGIVVAAV VDPRREVAAS
IRALAGNAPV HLGASVTDAI GGQSLRAVEI VDAAGKRQKL AADLLAVSGG WNPNIALATH
LGGKGEWNPE TSAFLAAGAP KAMTIAGAAA GRFTLAQALE DGARWGAEAA SRCGHAGAAQ
PAYRASDEAF AVTPLWQVAG ARSKAFVDLQ NDVTAADIAL SAREGFRSVE HLKRYTTLGM
ATDQGKTSNV NGLAMMAALT ERSIAAAGTT RARPPQVPVA IGAFGGLSTG KHFKPTRLTA
THDWSAQQGA SFVETGQWLR AQWFARPGET DWLQSVSREV DAVRSAVGIC DVSTLGKIAL
CGADVGVFLD RVYINTFSTL AVGKVRYGVM LREDGFVMDD GTTARLAEDH YVMSTTTANA
VKVMQHLEFC HQVLWPELDV QMVSVTEQWA QVAVAGPRSR TLLQNLFGPG VDLSDAAFPY
MACGEFRLGE VPARLFRISF SGERAYEIAV PAGYGDALMR ALMAAGEGLG VVPYGTEALG
VMRIEKGHAA GNELNGQTVA RDLGLGRMMS TKKDFIGRVM AGRPALIDPA RPTLVGLRPV
DRNDRLRNGA HLFAPGAAPS PETDQGFVTS SAFSPSLGHW IALALLSRGP DRIGERIRVY
DPIRAHDFEA EIVSPVFVDP EGERLRG