Gene Rpal_1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1411 
Symbol 
ID6409068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1489200 
End bp1490813 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content65% 
IMG OID642711310 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001990426 
Protein GI192289821 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.114372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCATC CACCGCGTTG GATGCGCTCG GTCGTCGCGT CGAAAATCGC TGTGCCGGCG 
TTCGCGCTCG CGGCATCGTT GACGCTGCCG GCGGCCGTCG ATGCCAAGAC GATCCGCGCC
GTCATGCATT CCGACCTGCG TATCATCGAT CCCGGCCTGA CCACCGCCTA CATTACCCGC
GACCACGGCT ATATGGTGTA CGACACGCTG CTGGCGATGG ACTCCAAGTT CAAGGTCCAG
CCGCAGATGG CGGACTACAA AGTCTCAGAC GACAAGCTGA CCTACACGTT CACGTTGCGC
GACGGACTGA AGTGGCACGA CGGCACCCCG GTCACCGCGG AGGATTGCGT CGCCTCGCTG
AAACGCTGGG GCCAGAAGGA CGGCATGGGC CAGAAGCTGA TGCAGTTCAC CGCCAGCCTC
GAAGCCACCG ATCCCAAGAC CATCACGCTG AAGCTGAAGG AGCCCTACGC GCTGGTGCTG
GAATCGATCG GCAAGCCGTC GTCGCTGGTG CCGTTCATGA TGCCGAAGCG GATCGCCGAG
ACGCCGGCCG ACAAGCCGAT CCCAGAGCAG ATCGGCTCCG GCCCGTTCAA GTTCGTGGCC
TCGGAATTCC AGCCCGGCGT CAAGGCGGTG TACGTGAAGA ACCCCGACTA CATCCCGCGC
AAAGAGGCGC CGGACTGGAC CTCGGGCGGC AAGGTCGTGA AGGTCGACCG CGTCGAGTGG
ATCACCATGC CGGACGCGCA GACGGCGGTG AACGCCCTGC AGTCGGGTGA CATCGACTTC
ATCGAGAACC CGTCGTTCGA CTTGCTGCCG GTGCTGGCGC AGGACAAGGA GCTGACGATT
GACACGCTGA GCCCGCTCGG CTTCCAGACT CTCGGCCGGA TGAACTTCCT GCACCCGCCG
TTCGATAATC CCAAGGTTCG CCGCGCCGCC TTCCTGGCGA TGAGCCAGAA GCCGGTGCTC
GACGCGCTGG TCGGCAATCC GAAGTACTAC AAGATCTGTG GCGCCGTGTT CGGCTGCGGC
ACGCCGCTCG AGACCGACGT CGGCTCCGAG ACGCTGGTCA AGGGCAACGG CATGGCCGAG
GCCAAGAAGC TGCTCGCCGA ATCCGGCTAC GACGGCACGC CGATCGCGCT GATGGCGCCC
GGCGACGTGG TGACGCTGAA GGCGCAGCCG ATCGTCGCTG CTCAGTTGCT GCGTGACGCC
GGCTTCAAGG TCGACGTCCA GGCCACCGAC TGGCAGACCG TGGTGTCGCG CCGCGCCAGC
CAGAAGCCGC CGAGCGAAGG CGGCTGGAAT ATGTTCTTCA CCAACTGGGC CGGCCCCGAC
ATTCTCAATC CGGTCGCCAA CGTTTCGGTC GGTGGTCAGG GCAAGAAGGG CGGCTGGTTC
GGCTGGGCGG AGGACGCCAA GGTCGAGGAG CTGCGCGACA AGTTCGTCCG CGCCAACTCG
CCGGACGAGC AAAAGAAGAT CGCCGAAGAG ATCCAGAAGG AAGTCTATGA GCAGGTGATC
TACATTCCGC TCGGCCAGTA CACCGCGCCG AGCGTGTGGC GCAAGGAGCT CTCCGGCATC
GTTCACGGCC CGGCGACCCC GGTGTTCTGG AACATCGACA AGCAGGGCGA CTGA
 
Protein sequence
MFHPPRWMRS VVASKIAVPA FALAASLTLP AAVDAKTIRA VMHSDLRIID PGLTTAYITR 
DHGYMVYDTL LAMDSKFKVQ PQMADYKVSD DKLTYTFTLR DGLKWHDGTP VTAEDCVASL
KRWGQKDGMG QKLMQFTASL EATDPKTITL KLKEPYALVL ESIGKPSSLV PFMMPKRIAE
TPADKPIPEQ IGSGPFKFVA SEFQPGVKAV YVKNPDYIPR KEAPDWTSGG KVVKVDRVEW
ITMPDAQTAV NALQSGDIDF IENPSFDLLP VLAQDKELTI DTLSPLGFQT LGRMNFLHPP
FDNPKVRRAA FLAMSQKPVL DALVGNPKYY KICGAVFGCG TPLETDVGSE TLVKGNGMAE
AKKLLAESGY DGTPIALMAP GDVVTLKAQP IVAAQLLRDA GFKVDVQATD WQTVVSRRAS
QKPPSEGGWN MFFTNWAGPD ILNPVANVSV GGQGKKGGWF GWAEDAKVEE LRDKFVRANS
PDEQKKIAEE IQKEVYEQVI YIPLGQYTAP SVWRKELSGI VHGPATPVFW NIDKQGD