Gene Rpal_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4549 
Symbol 
ID6412233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4901465 
End bp4902646 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content65% 
IMG OID642714429 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001993518 
Protein GI192292913 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGCT ACACGCATGT CCTGCGCAGC GTCGCCGTCG GCGCGACGGC TTTGGCGCTG 
TCGAGCTTTT CGCTGATGCA CAGCGCGCTC GCGCAATCCA CCATCAAGCT CGGCTCGGTG
CTGTCGATCA CCGGCCCCGC GTCCTTCCTC GGCGATCCAG AGGACAAGAC GCTGAAGATG
TATGTCGACA AGATCAACGC CGCGGGCGGC GTCGACGGCA AGAAGATCGA ACTCGTCGTC
TATGACGACG GCGGCGACGC CAACAAGGCC CGCACCTTCG CCACCCGCCT AGTCGAGGAC
GACAAGGTGG TGGCGATGAT CGGCGGCTCG ACCACCGGCA CCACGATGGC GATGATCCCG
GTGTTCGAGG AAGCGCAGAT CCCGTTCATC TCGTTTGCCG GCGCGGTCGA AATCATCGAC
CCCGTCCGCA AATACGTGTT CAAGACCCCG CACACCGACA AGATGGCGTG CGAGAAGATC
TTCGAGAACA TCAAGGCGCG CAAATTTACC AAGGTGGCGA TGATCTCCGG CACCGACGGC
TTCGGCTCGT CGATGCGGGC GCAGTGCCTG AAGGTCGCGG CGAACTACGG CGTCAGCATC
GTGGCCGAAG AAACCTACGG CCCGCGCGAC AGCGACATGA CGGCGCAGCT CACCAAGATC
AAGGCCGCCC CGGGCGTCGA GGCAGTGGTC AATCCCGGCT TCGGCCAGGG CCCGGCGATC
GTCACCCGCA ACTACGCGCA GCTCGGCATG TCGTCGACGC CGTTCTACCA GAGCCACGGC
GTCGCATCGA AGAGCTTCAT CGAGCTCGCC GGCCCGGCGG CCGAAGGCGT GCGGCTGCCC
GCCGCCGCGC TGCTGGTCGC CGACAAGCTG CCGGACAACG ATCCGCAGAA GAAGGTCGTC
ACCGAGTACA AGGCGACTTA CGAAGACACC ACCAAGCAGC CGGTCTCGAC CTTCGGCGGC
CACGCTTATG ACGGCCTCTA CATCCTGGTC GACGCGATGA AGCGGGCGAA GTCGACCGAT
CCGAAGAAAG TGCGCGACGA GATCGAGGCC ACCAAGGGCT TCGTCGGCAC CGGCGGCATC
GTCACCATGT CACCGACCGA TCACCTCGGC CTCGACCTGT CGGCGTTCCG GATGCTCGAG
GTCAAGAACG GCGACTGGAC GCTGGTGCAG CCGGGAAGCT GA
 
Protein sequence
MSRYTHVLRS VAVGATALAL SSFSLMHSAL AQSTIKLGSV LSITGPASFL GDPEDKTLKM 
YVDKINAAGG VDGKKIELVV YDDGGDANKA RTFATRLVED DKVVAMIGGS TTGTTMAMIP
VFEEAQIPFI SFAGAVEIID PVRKYVFKTP HTDKMACEKI FENIKARKFT KVAMISGTDG
FGSSMRAQCL KVAANYGVSI VAEETYGPRD SDMTAQLTKI KAAPGVEAVV NPGFGQGPAI
VTRNYAQLGM SSTPFYQSHG VASKSFIELA GPAAEGVRLP AAALLVADKL PDNDPQKKVV
TEYKATYEDT TKQPVSTFGG HAYDGLYILV DAMKRAKSTD PKKVRDEIEA TKGFVGTGGI
VTMSPTDHLG LDLSAFRMLE VKNGDWTLVQ PGS