Gene Rpal_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2047 
Symbol 
ID6409707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2216903 
End bp2218513 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content65% 
IMG OID642711933 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_001991045 
Protein GI192290440 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.171196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGA GCGGCATCGC GAGCGGTGAT ACCGAAGACA AGATTTTTAT CGGTAAAGGC 
GAACAGCCGG CGTGGTTGAC GTTGGCGCTC GCCAATCGTC ACGGCCTGGT CACCGGCGCC
ACCGGTACAG GCAAAACAGT GTCGCTGCAG GTGATGGCGG AAGGCTTTGC GCGCGCCGGC
GTACCGGTAT TTGCAGCCGA CATCAAAGGC GATCTATCCG GTATCGCCGA GGTCGGCGAG
GCCAAGGACT TCATCCTCAA GCGCGCCCAG GAGATCGGGT TGAAATTCCA GCCTGATCAG
TTCACCACCG TGTTCTGGGA CGTGTTCGGC GAGCAGGGCC ATCCGGTGCG CGCCACCGTT
TCCGAGATGG GACCGCTTTT GCTGTCGCGG ATGCTCGATC TCAACGACGT GCAGGAGGGC
GTACTCAACG TCGCATTCCG TGTCGCCGAC GACATGGGCC TGCCGCTGAT GGACATGAAG
GATCTGCGTG CGATGCTGGA TGCGATCGCG CCGATCGCCC AGAAGGTCGC GGACAACGGC
GACGTCAACG CCGACATTCG CCAGGCTGCG CAGTCGCTCG GCAACGTCAC CAAGCAGACC
GTCGGCACCA TCCAACGTCA GCTGCTGGTC CTGGAAAATC AGGGCGGCGA GAAGTTCTTC
GGCGAGCCTG CGCTGCAGCT CAAAGACTTC ATCCGCACCG ACAGCCAGGG CCGCGGCCTC
GTCAATATTC TGGTCGCCGA CAAGCTGATG ACCAATCCGC GTCTGTACGC GACCTTCCTG
CTGTGGATGC TGTCGGAACT GTTCGAGGAG CTGCCGGAAG TCGGCGATCC CGACAAGCCG
AAGCTGGTGT TTTTCTTCGA CGAGGCGCAC CTGCTGTTCA ACGACGCGCC GAAGCCGTTG
ATGGACAAAA TTGAACAGGT GGTGCGGCTG ATCCGCTCCA AGGGCGTCGG CGTGTACTTC
ATCACCCAGA ACCCGATCGA CGTGCCCGAC CGGGTGCTGG CGCAGCTCGG CAATCGGGTG
CAGCACGCGC TCCGTGCTTT CACGCCGCGC GACCAGAAGG CGGTGGCGGC CGCGGCTGAC
ACCTTCCGGC CGAACCCGAG GCTCGATACG GCCAAGGCGA TCACCGAGCT CGGCAAGGGC
GAAGCGCTGG TGTCGTTCCT CGAAGGCAAC GGCACGCCGG CGATGGTCGA GCGCGTGCTG
GTGCGGCCGC CGTCGGCGCG GATCGGGCCG ATCACGCCGG AGGAGCGCAA GGCGATCATC
GCCGCGAGCC CGGTGAGGGG CAAATACGAC ACCGCGGTGG ATTCCGAATC CGCCTATGAG
AAGCTGCGCG CCCGGATCGA CGGCAAGTCG GCGAGTGAAG GCCCTGCACC GGGCGAGGGC
GGCATTCTCG GTCAGCTCGG CGGGCTGTTC TCGACCGTGT TCGGCACCAA CACGCCGCGC
GGCAAGCTCA CCACCGGTCA ACTCGTCGCC CGCAATGTCG CCCGCACCGT CGCAACCACG
GTGGTCGGCG GCGTCGCCGC CGAGCTCGGC AAGAAGGTCG GCGGATCGCT CGGCAGCTCG
GTCGGTCGCT CGATCGTGCG TGGCACGCTG GGCGGAATGC TGCGACGGTA A
 
Protein sequence
MAASGIASGD TEDKIFIGKG EQPAWLTLAL ANRHGLVTGA TGTGKTVSLQ VMAEGFARAG 
VPVFAADIKG DLSGIAEVGE AKDFILKRAQ EIGLKFQPDQ FTTVFWDVFG EQGHPVRATV
SEMGPLLLSR MLDLNDVQEG VLNVAFRVAD DMGLPLMDMK DLRAMLDAIA PIAQKVADNG
DVNADIRQAA QSLGNVTKQT VGTIQRQLLV LENQGGEKFF GEPALQLKDF IRTDSQGRGL
VNILVADKLM TNPRLYATFL LWMLSELFEE LPEVGDPDKP KLVFFFDEAH LLFNDAPKPL
MDKIEQVVRL IRSKGVGVYF ITQNPIDVPD RVLAQLGNRV QHALRAFTPR DQKAVAAAAD
TFRPNPRLDT AKAITELGKG EALVSFLEGN GTPAMVERVL VRPPSARIGP ITPEERKAII
AASPVRGKYD TAVDSESAYE KLRARIDGKS ASEGPAPGEG GILGQLGGLF STVFGTNTPR
GKLTTGQLVA RNVARTVATT VVGGVAAELG KKVGGSLGSS VGRSIVRGTL GGMLRR