Gene Rpal_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3961 
Symbol 
ID6411642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4251267 
End bp4252331 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content64% 
IMG OID642713842 
ProductABC transporter related 
Protein accessionYP_001992932 
Protein GI192292327 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.565548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGG TGCAGATTCA CGACGTGCGT AAGTCTTTCG GCGGCTTCGA AGTATTGCAC 
GGCGTGACTG TTCCCATCGA GGATGGTGAG TTCGTCGTTC TGGTCGGCCC GTCCGGCTGC
GGCAAATCCA CTTTGCTGCG GATGCTCGCA GGGCTGGAGA AGATCACTGC CGGAACGATC
TCGATCGGTG AGCGCGTCGT CAACGACGTG CAGCCGAAGG AGCGGGACAT CGCGATGGTG
TTCCAGAACT ACGCGCTGTA TCCGCACATG ACCGTCGCCC AGAACATGGG CTTCTCGCTG
AAGCTGCGCG GCGCCGACCA GAAGGCCATC GACAGCAAAG TGCAGCGGGC GGCCGACATC
CTCGATCTCG GCAAGCTGCT CGACCGCTAT CCGCGCCAGC TCTCCGGCGG CCAGCGCCAG
CGCGTTGCGA TGGGGCGGGC GATCGTGCGC GATCCGCAGG TGTTCCTGTT CGACGAGCCG
CTGTCGAACC TCGACGCCAA GCTGCGGGTG GCGATGCGCA CAGAGATCAA GGAGCTGCAT
CAGCGGCTGA AGACCACCAC GGTTTACGTC ACCCACGATC AGATCGAGGC GATGACGATG
GCCGACAAGA TTGTGGTGAT GCAGGACGGT ATCGTCGAGC AGATCGGCGC GCCGCTCGAC
CTGTACGACC GGCCCGACAA TAAGTTCGTG GCCGGCTTCA TCGGTTCGCC GGCGATGAAC
TTCCTCGACG GCACGCTGAA AGTGAATGGC GGCCAGCCAT ATGTCGAGAC CGCCAGCGGC
GCCAAGCTGC CGATCGCCGC GGCGCCTGCG AACGGCAATG GCCGCCCGGT GTCCTACGGC
ATTCGTCCCG AGCATCTCGA CTTTGCAGAT AGTGGCATCC CGGCCGAGGT CGCGGTGGTC
GAACCGACCG GCTCGGAAAC CCAGATCGTG GCCCGGGTCG GAAATCAGGA AGTGATCGCG
GTTTTTCGCG AGCGGCATCC GGTCGGGCCC GGCGATCTGA TCCATCTGCA GCCGCGCGCC
GACGTCGCGC ATCTGTTCGA CAAGGAGAGC GGCCGGCGGA TCTAG
 
Protein sequence
MASVQIHDVR KSFGGFEVLH GVTVPIEDGE FVVLVGPSGC GKSTLLRMLA GLEKITAGTI 
SIGERVVNDV QPKERDIAMV FQNYALYPHM TVAQNMGFSL KLRGADQKAI DSKVQRAADI
LDLGKLLDRY PRQLSGGQRQ RVAMGRAIVR DPQVFLFDEP LSNLDAKLRV AMRTEIKELH
QRLKTTTVYV THDQIEAMTM ADKIVVMQDG IVEQIGAPLD LYDRPDNKFV AGFIGSPAMN
FLDGTLKVNG GQPYVETASG AKLPIAAAPA NGNGRPVSYG IRPEHLDFAD SGIPAEVAVV
EPTGSETQIV ARVGNQEVIA VFRERHPVGP GDLIHLQPRA DVAHLFDKES GRRI