Gene Rpal_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1941 
Symbol 
ID6409601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2093716 
End bp2094966 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID642711827 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001990939 
Protein GI192290334 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCCAG CCCCCTCGAA TGTCCGTTCG TGGTCGATGC GCGCTGTGGT CGCCGCGACT 
GCGATCGGAT TCGGTGCTTC GTCGGCGCTC GCTGCCGACC CGATCAAGAT CGGCGTGATC
GCTGAAGCGC AGGCTATCGC CGGCGCTTCC ATTCCGCAGG CTGCGCAGCT CGCCGCTGAA
GAAATCAACG CGAAAGGCGG CATCGACGGC CGCAAGATCG AGATCGTCAG CTACGACAAC
CACTCCTCGT CGGCCGATTC CGTGCGAGCG TTCCAGCGCG CCGTGAATGA AGACAAGGTC
AACGCGGTGA TCGCCAGCTA CATCTCGGAA GTTGTGCTGG CGCTGATGCC GTGGGCCTCG
CGGCTGAAAA CGCCGTTCGT CACGCCGGGC GCGGCGTCCA ACGAGATCAC CAAGGCGATC
AACAAGGACT ACGAGAAGAA CAAATACACC TTCCACGGCT ATCTCACCTC CGGCGAGCTT
GCCCAGTCGG TGTGCGATGC AGCGAAGGAC CTGTTGGTCG ACGCCCGCCA GATGAAGAGC
GCGGTGATCA TGAGCGAGGA CGCGGCCTGG ACCAAGCCGC TCGACGTCGG CTACCAGGAG
TGCCTGCCGA AGGTTGGGCT GAAGGTGCTC GACCACATCC GGTTCTCGCC CGACACCACT
GATTTCACGC CGATCTTCAA TAAGATCGAA GGCTCAAAGC CGGACGTGAT CATCACCGGC
ATCTCGCATG TCGGCGTCCA GCCGACGGTG CAGTGGAAGA ACCAGCAGGT GCCGATCCCG
ATGTTCGGCA TCGCCTCCCA GGCGACCAAC GAGACCTTCG GCAAGGACAC CAACAACGCC
TCCGACGGCG TGCTGTACCA GGGCGTGTCG GGCCCCGGCG TCGCAGTGAC CTCGAAGTCG
GTGCCGTTCG CCGAGAATTT CAAGAAGAAG TACGGCAACT ATCCGTCTTA CGCCGGCTAC
ACCGCCTATG ACGAGGTCTA TTACATCGCC GAAGCGGTGA AGCGCGCCGG CTCCACCGAC
GGCGAGAAGC TGGTCGAAGC GCTGGAAAAG ACCGACTACG AAGGCACCAT CGGCCGCGTC
CAGTTCTACG GCAAGGACGA GCCGTTCACC CACGGCCTGA AATACGGCAA GGGCCTGCTG
ACCGGTCTGA TGCTGCAATG GCAGGACGGC AAGCAGGTCG CGGTGTGGCC GCCGGAAGTG
GCCAAAGCCA AGATCAAGTT CCCGGCGTTC ATCAAGGCCG CCGCGAACTG A
 
Protein sequence
MSPAPSNVRS WSMRAVVAAT AIGFGASSAL AADPIKIGVI AEAQAIAGAS IPQAAQLAAE 
EINAKGGIDG RKIEIVSYDN HSSSADSVRA FQRAVNEDKV NAVIASYISE VVLALMPWAS
RLKTPFVTPG AASNEITKAI NKDYEKNKYT FHGYLTSGEL AQSVCDAAKD LLVDARQMKS
AVIMSEDAAW TKPLDVGYQE CLPKVGLKVL DHIRFSPDTT DFTPIFNKIE GSKPDVIITG
ISHVGVQPTV QWKNQQVPIP MFGIASQATN ETFGKDTNNA SDGVLYQGVS GPGVAVTSKS
VPFAENFKKK YGNYPSYAGY TAYDEVYYIA EAVKRAGSTD GEKLVEALEK TDYEGTIGRV
QFYGKDEPFT HGLKYGKGLL TGLMLQWQDG KQVAVWPPEV AKAKIKFPAF IKAAAN