Gene Rpal_4559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4559 
Symbol 
ID6412243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4910990 
End bp4912168 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID642714439 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001993528 
Protein GI192292923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.61677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATTG GCAGACGAAC GCTGCTGCAC GCCGCCTGCA TCGCTTTGGC CGGCGCAACG 
ACGAGCACGG TGGCGCATGC CGAGGACACC GTGAAGATCG GCCTGATCGT GCCGATGACC
AGCGGCCAAG CCTCGACCGG CAAGCAGATC GACAACGCCG TCAAGCTGTA CATGAAGCAG
AACGGCTCCA CCGTCGCCGG CAAGAAGATC GAAGTGATCC TGAAGGACGA CGCCGCGGTG
CCGGACAACA CCAAGCGGCT CGCGCAGGAA CTGATCGTCA ATGACAAGGT CAACGTGATC
GCCGGCTTCG GCATCACGCC CGCCGCGCTC GCAGCGGCGC CGCTCGCCAC CCAGGCGAAA
GTGCCCGAAG TGGTGATGGC GGCCGGCACC TCGATCATCA CCGAGCGCTC GCCCTATATC
GTCCGCACCT CGTTCACGCT GCCGCAGTCC TCGACCGTGA TCGGCGATTG GGCGGTAAAG
AACGGCATCA AGAAGGTGGT GACGCTGACC TCCGACTACG CGCCGGGCAA TGACGCGCTG
GCGGCGTTCA AGGAGCGCTT CACCGCCGGC GGCGGTCAGA TCGTCGAAGA GGTCAAGGTG
CCGCTCGCCA ATCCGGACTT CGCGCCGTTC CTGCAGCGCG CCAAGGACTC CAAGCCGGAC
GCGATGTTCG TGTTCGTTCC GGCCGGTCAG GGCGGCAACT TCATGAAGCA GTTTGCCGAG
CGCGGCCTCG ACAAGTCGGG CATCAAGGTG ATCGGCCCCG GCGACGTGAT GGACGACGAC
CTCTTGAACA GCATGGGCGA CGCCGCGATC GGCGTGGTCA CTGCGCACAT CTATTCGGCG
GCGCATCCGT CGGAGAAGAA CAAGGCGTTC GTCGCCGCCT ACAAGAAGGA ATTCGGCCAG
CGGCCCGGCT TCATGGCGGT CGGCGGCTAC GACGGCATCC ACCTGATTTA CGAGGCGCTG
AAGAAGACCG GCGGCAAGGC CGACGGCGAT TCGCTGATCG CCGCGATGAA GGGCATGGCT
TGGGAAAGCC CGCGCGGCCC GATCTCGATC GACCCCGAAA CCCGCGACAT CGTCCAGAAC
GTCTATATCC GCAAGGTCGA GAAGGTCGAT GGCGAGCTCT ACAACGTCGA GTTCGACAAG
GTCGACGCGG TGAAGGATCC GGGCAAGACG AAGAAGTAA
 
Protein sequence
MLIGRRTLLH AACIALAGAT TSTVAHAEDT VKIGLIVPMT SGQASTGKQI DNAVKLYMKQ 
NGSTVAGKKI EVILKDDAAV PDNTKRLAQE LIVNDKVNVI AGFGITPAAL AAAPLATQAK
VPEVVMAAGT SIITERSPYI VRTSFTLPQS STVIGDWAVK NGIKKVVTLT SDYAPGNDAL
AAFKERFTAG GGQIVEEVKV PLANPDFAPF LQRAKDSKPD AMFVFVPAGQ GGNFMKQFAE
RGLDKSGIKV IGPGDVMDDD LLNSMGDAAI GVVTAHIYSA AHPSEKNKAF VAAYKKEFGQ
RPGFMAVGGY DGIHLIYEAL KKTGGKADGD SLIAAMKGMA WESPRGPISI DPETRDIVQN
VYIRKVEKVD GELYNVEFDK VDAVKDPGKT KK