Gene Rpal_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1598 
Symbol 
ID6409255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1709871 
End bp1711115 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content63% 
IMG OID642711487 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001990602 
Protein GI192289997 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCATTT CCCGACGTTC ATTCGGAATC GGTGCGACTG GTCTGGTGCT CGGAACGGTC 
GCAGCGCCTT GGGTTCGCAA CGCCAGCGCA GAGCCGGCCC CGATCAAAAT CGGCGTGATC
AATTCGATGA GCGGCGGTCT GTCGGCCTAC GCGCAGGAAG GCAAGCCGGC GTTCGATTAC
ATCATCGATC AGATCAACAA GAGCGGCGGC ATCAAGAGCA AGGACGGCGC CAAGATTCAG
TTGCTGCAGG CCGACGACGC CAGCCAGCCG GCGCGGACCG CCACCGAAGC GCGCCGCCTG
ATCACCGAAG AAAAGGTGCC GCTGCTGACC GGCACCATCC TCAGCGCACA GATGCTGGCG
CTGACGCCGG TGCTCGACGA ATTGAAGGTG CCGACTCTGT CGATCTGGGC TGGCGGCGCC
AGGTCGAGCT ACATGTTCTC GCTCGGTTAT CCGTATGACC GCGGCTACGC GCAATCGATG
CACGATTTCA TCGTGTCGCT GCGCGACAAC GATAAGTTCC CGATCAAGAC CGCGGTGATG
TGCTACTCGA ACTACGAGGC CGGCCAGCAG GTCAACAAGT TCCTGATCGA GAAGCTGAAG
GCCAGCGGCA TCGAGGTGAT CGGCGAAGCG CCGCTCGACA CCAAGGCGCA GGACCAGACC
TCGGCGATGA TCCGCATCCG CTCGCTGAAG CCGGACGTCG TCACCGGACT GGTGACACCG
CGCGACGGCA TTCTGATGCA TCAGGCGCGC TACAACCTCA ACTATCAGGG CAGCCTGTTC
GTCGGCGGCA CCGGCGGTTA TTCGGACCTG TCGCTGTGGA AGGATCTCGG CCCCGAGATC
GGCAAGGCGG TGCTGACGCG CAACCTGTTC GGCATGACCG GCTTCAGCGC CGGCGCCAAG
ATGGACTCAA TGCAGAAGAT CATCACCGAG CTGCGCGACG TTGCCAAGCT CGAGCGCATC
GGCCAGGGCG CGGTTCAGTA TGCCCAGGGC GCGCGCGTGC TGCAGCAGGT GCTTGAGAAC
GCCAAGTCGC TGGAGCCGGA CGCGCTGCTC GAGGCGTTCA AGAGTTTCAA GATCCCGTTC
GGCGATCCGC ATCTCTACAT CGCCAAGCCG AAGGGCCTGC AGTTCGCCGA GGACCGGCTG
CTGACCGACG GTTCAGCGAT GATGATCCAG TGGATGCCGG ATCAGAGCCA GGAGGTCGTG
TTCCCGAAGG AGTTCGCACA GGCAGCTCCG CGTCCCAAGA GCTGA
 
Protein sequence
MSISRRSFGI GATGLVLGTV AAPWVRNASA EPAPIKIGVI NSMSGGLSAY AQEGKPAFDY 
IIDQINKSGG IKSKDGAKIQ LLQADDASQP ARTATEARRL ITEEKVPLLT GTILSAQMLA
LTPVLDELKV PTLSIWAGGA RSSYMFSLGY PYDRGYAQSM HDFIVSLRDN DKFPIKTAVM
CYSNYEAGQQ VNKFLIEKLK ASGIEVIGEA PLDTKAQDQT SAMIRIRSLK PDVVTGLVTP
RDGILMHQAR YNLNYQGSLF VGGTGGYSDL SLWKDLGPEI GKAVLTRNLF GMTGFSAGAK
MDSMQKIITE LRDVAKLERI GQGAVQYAQG ARVLQQVLEN AKSLEPDALL EAFKSFKIPF
GDPHLYIAKP KGLQFAEDRL LTDGSAMMIQ WMPDQSQEVV FPKEFAQAAP RPKS