Gene Rpal_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4000 
Symbol 
ID6411682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4286872 
End bp4288083 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID642713882 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001992971 
Protein GI192292366 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGT TTCGGCATAC AACGATCGTC GCCGCGGCTG CGCTGCTTGC AGTGCTCAGC 
GCCGGCCCAG CGCTCGCCGG CGGCAGCTAT GACCCCGGCG CCAGCGACAG CCTGATCAAG
CTCGGCCAGA CCATGCCGTA TTCGGGACCG GCGTCGGCCT ATTCGACGAT TGGGCGCGCC
GAGGCCGCTT ACTTCAAGAT GCTGAACGAC AAGGGCGGCA TCAACGGCCG CAAGGTCGAG
TTGCTGAGCC TCGACGATGC CTACTCGCCG TCGAAGACGG TGGAGCAGGT CCGCCGGCTG
GTCGAAAGCG ACGAAGTGCT GGCGATGTTC TCGATCCTCG GCACCGGGCC GAACATCGCG
GTGCAGAAGT ATCTCAATAT CAAGAAGGTC CCGCAGCTAT TGCCGTTCAG CGGCGCCACG
CGCTGGAACG ACGCCAAGCA CTTCCCGTGG ACCACCGGCT CGCAGCCGAC CTACAAGACC
GAGGGCAGGA TCTACGCGAA GTGGATTCTC GCCAACAAGC CGAATGCTAA AATTGCGGTG
ATCACGCCGG CCGAAGAAGC CGGCCGCGAT TATCTCGCCG GCTTCAAGGA AGGACTCGGC
GACCATGTGA ACCAGATCGT GTCTGAGGCG GTGTATGAAA CCACCGATCC GACCGTCGAC
TCCCAGATCG TCAAGTTCAA GGCCGCCGGC GCCGACGTGC TGTTCAACGA ATGCACGCCG
AAATTCGCCG CGCAGGTGAT CAAGAAGGCC GCCGAGCTCG GCTGGAAGCC GCAGATCATT
CTGCCCGCGG TTTCGAATTC GGTCGGCTCG GTACTCAAGC CGGCGGGGCT GGAGAATGCG
GTCGGCATCG TCACCGGCGC TTACGTGAAG GATCCGGGCG ATCCGCGCTG GGCCAATGAT
CCCGGCATGC AGCAATGGCA CGCCTGGATG AAGACCTACA ATGCGGGTGC CGATCCGGCC
GATATCTTCA ACGTCTACGG TTACACGATC ACGCAGATCA TGGAGCTGGT GCTGCGTCGC
GCCGGCGACG ATCTCACCAG GGCAAATCTG ATGAAGCAGA TCGAGTCGCT CGATGGCGTC
GAGCTGCCGA TGCTGCTGCC TGGCATCAAG CTGCAGATGT CGCCCGACCA GCGCACGCCG
ATCCGGCAGT TGCAGATGGC GCGCTTCAAC GGCACCTCCT GGGAGCTGTT CGGCGACGTA
CTGAGCGAGT AG
 
Protein sequence
MQLFRHTTIV AAAALLAVLS AGPALAGGSY DPGASDSLIK LGQTMPYSGP ASAYSTIGRA 
EAAYFKMLND KGGINGRKVE LLSLDDAYSP SKTVEQVRRL VESDEVLAMF SILGTGPNIA
VQKYLNIKKV PQLLPFSGAT RWNDAKHFPW TTGSQPTYKT EGRIYAKWIL ANKPNAKIAV
ITPAEEAGRD YLAGFKEGLG DHVNQIVSEA VYETTDPTVD SQIVKFKAAG ADVLFNECTP
KFAAQVIKKA AELGWKPQII LPAVSNSVGS VLKPAGLENA VGIVTGAYVK DPGDPRWAND
PGMQQWHAWM KTYNAGADPA DIFNVYGYTI TQIMELVLRR AGDDLTRANL MKQIESLDGV
ELPMLLPGIK LQMSPDQRTP IRQLQMARFN GTSWELFGDV LSE