Gene Rpal_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4334 
Symbol 
ID6412018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4660787 
End bp4661893 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID642714216 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001993305 
Protein GI192292700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0248694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTCA AGCTTCTCGG TTTGGCATTC GGCGTCTCGC TGGCGCTCTC GACTACGGCG 
CTGGCGCAGG ACATCAAGGT CTCGGTCGCC GGTCCGATGA CCGGCGGCGA ATCCGCGTTT
GGCCGGCAGC TCAAGAACGG CGCTGAACAG GCGGTGGTCG ACCTCAACGC CAAGGGTGGC
CTGCTCGGCA AGAAGCTGGT GCTCGACGTC GAGGACGATG CCTGCGATCC GAAGCAGGCG
CGCTCGGTCG CCGAGAAGAT CGCGGGCGAC GGCATCCCGT TCGTCGCCGG TCACTTCTGC
TCGTCATCGT CGATCCCGGC GTCGGAAGCC TACGCCGACG GCAACGTGCT GCAGATCACG
CCGGCCTCGA CCAACCCGCT GTTCACCGAG CGCAAGCTGT GGAACGTGCT GCGCGTCTGC
GGCCGCGACG ATCAGCAGGG CCTGGTCGCC GCCGAGTACA TCCTTAAGAA CTACAAGGGC
AAGAACGTCG CCATCCTCAA CGACAAGACC ACTTACGGCA AGGGTCTGGC CGACGAGACC
AAGAAGGCGC TGAACAAGGC CGGCTTCCAG GAGAAGATGT TCGAGTCCTA CAACAAGGGC
GACAACGACT TTAACTCGAT CGTGTCGCGG CTGAAGCGCG ACGCCATCGA TCTGGTGTAC
ATCGGCGGTT ATCACCGCGA GGCCGGCCTG ATCCTGCGCC AGATGCGCGA CCAGGGCCTC
AGCACCGTGA TGATGGCTGG CGACGCGATG AACGACAAGG AATTCGCCTC GATCACCGGT
CCGCTGGCCG CAGGCACGCT GTTCACCTTC GGCCCCGACC CGCGCAACAA GCCGACCGCC
AAGCAGATCG TCGAAACCTT CAAGGGCAAG GGCATCGATC CGGAAGGCTA CACCCTCTAC
ACCTACGCGG CGTTCCAAGT GTGGTCGCAG GCGGTCGAGA AGGCGAAGTC GACCGACCCG
AAGAAGGTGA TCGAGACCAT CAAGGCCGGC GACTGGGACA CCGTGCTCGG CAAGATGGCG
TTCGACGCCA AGGGCGACAT CAAGGCGATC GACTACGTCG TCTACAAATG GGACGCCAAG
GGCGGCTACG CCGAGATCAA TCCTTAA
 
Protein sequence
MTLKLLGLAF GVSLALSTTA LAQDIKVSVA GPMTGGESAF GRQLKNGAEQ AVVDLNAKGG 
LLGKKLVLDV EDDACDPKQA RSVAEKIAGD GIPFVAGHFC SSSSIPASEA YADGNVLQIT
PASTNPLFTE RKLWNVLRVC GRDDQQGLVA AEYILKNYKG KNVAILNDKT TYGKGLADET
KKALNKAGFQ EKMFESYNKG DNDFNSIVSR LKRDAIDLVY IGGYHREAGL ILRQMRDQGL
STVMMAGDAM NDKEFASITG PLAAGTLFTF GPDPRNKPTA KQIVETFKGK GIDPEGYTLY
TYAAFQVWSQ AVEKAKSTDP KKVIETIKAG DWDTVLGKMA FDAKGDIKAI DYVVYKWDAK
GGYAEINP