Gene Rpal_1752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1752 
Symbol 
ID6409409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1880342 
End bp1881658 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content62% 
IMG OID642711640 
Productputative UreA/short-chain amide transport system substrate-binding protein 
Protein accessionYP_001990755 
Protein GI192290150 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.225612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTG ACCGTCGGCA GCTTTTGCTC GGAGGAGTCG GCGCCGCAGC CGGGCTCGCG 
CTTCCTGGAA GTGCATTTGC GCAATCGCCG GCGGCGATCG GAACCTTTCC GGCAGGTGTT
TCGGCCGACT CGGTGTTCGT AGGCCTGACC ATTCCGCTCA CCGGGGTTTT CTCGGGCGAT
GGCGGCGACC TCAAGCTCGG CTATGAGCTG GCGATCGCGC AGATCAACGC CGGTAGCGAG
ATCGCGCAGC AATGGGGACT CAAGGGCAAG GGAGTGCTGG GCCGGCAAAT TCGCCACAAG
GTCTCGGACA CCGAGGGCAA GCCGAACCTC GATGTGCAGA GCGCCACGCA GTTCATCCAG
CGCGACAAGG CCATCATGGT GTCGGGCTCG GTGTCGAGCT CCAGCGCGAT CGCGCTCGAA
GAGCTCGGGT CGCGCGAGAA AGTGCTGTAC ATGGTCGGCA TCGCCGGCTC CAACGACATC
ACCGGAAAGA ATTGCCAGCG CTACGGATTC CGCTCTCAGC AGAACGCCTA TATGGCGGCC
AAGGGCCTCG CTCCGGTGGT GGCGAAGGCG CTCGGCAAGA ACGTCAAGAT GGCCTTCCTG
GTGCCCGACT ACACCTACGG CCACAGCGTG TATGACAGCT TTTCCAAGTT CGCGACCGAG
CAGGGCTGGA AGCAGGTTGC CAAGGAAGTG GTGCCGCTCG GGACCACCGA TTACTCCTCG
GCGTTGCTCA ATATCGCCAA CAGCGGCGCC GATGTGTTCG TCAACATCGC CTTCGGTGCC
GACTCCGTCG CCTCGACTAA GCAGGCCGAG CAGTTCGGTG TGCTGAAGCG GATGAAGCTC
GTCGTGCCCA ATCTGTCGTC GTTCCAGGAC AAGGAGCTCG GCGCCGAGTT GATGCAGGGG
GTCTACGGAA GCTGTGATTT CTGGTTCGGT CTGCAGGACA AGTTCCCGCT CGCCAAGGCG
TTCGTCGACA GCTTCGTCGC GCAGAACAAT TACCATCCGC GCTGGGGTGC CCATATCGGC
TACATGCAGA CCTATCTGTG GGCCATGTCG GTCGAGCGCG CCAACACCTT CAATCCGGTG
GACGTGATCA AGGTGATGGA GAATTCCAAG GCGCAGCCAT ACGTCACGAC GATCGGCAAA
GTGTATTTCC GCGCCGAGGA CCATCAGATG GTGCGCCCGA TCCCGATTCT GCGCGGCAAG
AAGCCGGCGG AGATGAAGCA CAAGGAAGAC TTCTACGACA TCATCGACCT CGTCGACGGC
GAGGCCGTGA TGAATCCGCC GGACCTGTTC GGTTGCAAGC TCGGCCCCTA CACCTGA
 
Protein sequence
MQFDRRQLLL GGVGAAAGLA LPGSAFAQSP AAIGTFPAGV SADSVFVGLT IPLTGVFSGD 
GGDLKLGYEL AIAQINAGSE IAQQWGLKGK GVLGRQIRHK VSDTEGKPNL DVQSATQFIQ
RDKAIMVSGS VSSSSAIALE ELGSREKVLY MVGIAGSNDI TGKNCQRYGF RSQQNAYMAA
KGLAPVVAKA LGKNVKMAFL VPDYTYGHSV YDSFSKFATE QGWKQVAKEV VPLGTTDYSS
ALLNIANSGA DVFVNIAFGA DSVASTKQAE QFGVLKRMKL VVPNLSSFQD KELGAELMQG
VYGSCDFWFG LQDKFPLAKA FVDSFVAQNN YHPRWGAHIG YMQTYLWAMS VERANTFNPV
DVIKVMENSK AQPYVTTIGK VYFRAEDHQM VRPIPILRGK KPAEMKHKED FYDIIDLVDG
EAVMNPPDLF GCKLGPYT