Gene Rpal_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2356 
Symbol 
ID6410018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2544333 
End bp2545331 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID642712236 
ProductApbE family lipoprotein 
Protein accessionYP_001991346 
Protein GI192290741 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAT CTACTCTCAC CCGACGGCGT TTCGTCACGA TCGTTGCCAG TGCGTTTGGC 
GTCGCGATGC TCGGCCGCGT GGTGCCGTCG CGCGCGAGCG AGCCGGTGCG CTGGCGCGGC
GCGGCGCTCG GCGCGCAGGT GTCGATCGAG ATCCACCATC CTGATCGCGT CGCCGCCGAG
AGGCTGGTGG AGAGGGGCGT GCGCGAGGTA CGCCGGCTCG AGCAGATGTT CAGCCTGTAT
CGGCCGGACT CGGCGATCTG CGCGCTCAAC CGGTCAGGTG TGCTGATCGC GCCGGACCGC
GATGTGGTGG CGCTACTGCA GACGACGTTA GACTTCGCCG CGCAAACCGG CGGCGTATTC
GATCCCACCG TGCAGCCTCT GTGGCAGCTG TATCGACGCC ACTTCGAGCA GGCCGGGGCG
GATCCGTCCG GTCCAGCGAA GGCAGACGTC GCAGGTGCGC TGGCGAAGGT CGGATACGAT
GGCGTGCTGG TGTCAGCCGA TCGGATCGCG CTGAAACGAC CGGGCGCCGC GATCACGCTC
AATGGTATCG CCCAGGGCTT CGCCACCGAT CGCGTGGTCG ATCTGCTGCG CAAGGGCGGG
ATGACCAGCA CCCTGGTCGA CATCGGTGAG ATCCGTGCGA TCGGTGCCCG GCCGGACGGC
GTGCCGTGGC GGGTCGGACT GGCCGATCCG GAAACGACGA GTGCCAACCT CGGCACCGTC
GATCTGGTTG ACCGCGCGGT TGCGACGTCC TCTGGCGCCG GCTTCCGGTT CGATCCGGCC
GGGCAGTTCA CGCATCTGTT CGATCCCTCG ACCGGACGCA GCCCGGCGCT GTATCGTTCG
GTCAGTGTCG TCGCGCCCAC CGCGACCGAG GCCGATGCGC TGTCGACCGC GTTCAGCGTG
CTGGAGCGTG GCCGCATCGA TGCGATCGTT CAGGCGAGGG CAGGCGTCGA GGTGTTGCTC
GCCGATGCTG AGGGAGGCGT GCAGTGGCTG CGCGGGTAG
 
Protein sequence
MSQSTLTRRR FVTIVASAFG VAMLGRVVPS RASEPVRWRG AALGAQVSIE IHHPDRVAAE 
RLVERGVREV RRLEQMFSLY RPDSAICALN RSGVLIAPDR DVVALLQTTL DFAAQTGGVF
DPTVQPLWQL YRRHFEQAGA DPSGPAKADV AGALAKVGYD GVLVSADRIA LKRPGAAITL
NGIAQGFATD RVVDLLRKGG MTSTLVDIGE IRAIGARPDG VPWRVGLADP ETTSANLGTV
DLVDRAVATS SGAGFRFDPA GQFTHLFDPS TGRSPALYRS VSVVAPTATE ADALSTAFSV
LERGRIDAIV QARAGVEVLL ADAEGGVQWL RG