Gene Rpal_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4043 
Symbol 
ID6411726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4337739 
End bp4338728 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID642713925 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_001993014 
Protein GI192292409 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.860654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACATA CCATTCTGTT CGGCGGCACC AGCAAAGAGC GCCTGGTGTC GGTCGCCTCG 
GCGCAGGCGC TGCACAGTGC GCTGCCGGAC GCCGATCTGT GGTTCTGGAA CGACGACGAC
TGCGTCCACG CCGTGTCGGC GCAAGCGTTG CTGGCGCACG CCCGGCCGTT CGAAGAGGCG
TTCGTGCCGG ACGGCGAGGA CATCGGCCCG CTCGAGACCG CGCTCGACCG TGCCGCGGCC
GAGCAGCGGC TGCTGGTGCT CGGCCTGCAC GGCGGCGTCG CCGAGAACGG CGAGTTGCAG
GCAATGTGCG AATTGCGCCG GGTGCCGTTC ACCGGCTCTG GCTCGGCGGC TTCGCATTTA
GCCTTCGACA AGGTCGCCGC CAAGCGCTTC GCCTCGATCG CCGGTGTCCG GACCGTGACC
GGGATCGCGC TGGCCGATGC TGAGGCGGCG CTCGAAGCTC ACGGCCGGCT GATCGCCAAG
CCGGCCTGCG ACGGCTCCAG CTATGGGCTG TTCTTCATCA ACGCCAAGCA GGACCTGGTG
GCGGTGCGCC ATGCCGCCAA GACCGAGGAC TATCTGATCG AGCCGTTCGT CGCCGGCGTC
GAGGCGACCT GCGGCGTGCT CGAAGCCGAG GACGGCTCGC TGCGCGCGCT GCCGCCGATC
GAGATCGTGC CGGCCGACGG CGGCTTCGAC TACACGGCGA AATATCTCGC CAAATCGACC
CAGGAGATCT GCCCCGGCCG GTTCGCCCCC GAGATCGCGT CGGCCATCAT GGACTACGCC
GTGCGGGCGC ACCGGGTGAT GTCGTGCCGG GGCTATTCGC GCTCCGACTT TATCGTCGGC
AAGGACGGTC CGATCTTCCT GGAGACCAAC ACGCTGCCCG GCCTGACCAA GGCCTCGCTC
TATCCCAAGG CGTTGAACGC GCTGGGAATC CCCTTCGTCG ACTTCCTCCG CGGCCAGATC
GCGCTGGCCG AGCGCGCCGT GCGGAGCTGA
 
Protein sequence
MRHTILFGGT SKERLVSVAS AQALHSALPD ADLWFWNDDD CVHAVSAQAL LAHARPFEEA 
FVPDGEDIGP LETALDRAAA EQRLLVLGLH GGVAENGELQ AMCELRRVPF TGSGSAASHL
AFDKVAAKRF ASIAGVRTVT GIALADAEAA LEAHGRLIAK PACDGSSYGL FFINAKQDLV
AVRHAAKTED YLIEPFVAGV EATCGVLEAE DGSLRALPPI EIVPADGGFD YTAKYLAKST
QEICPGRFAP EIASAIMDYA VRAHRVMSCR GYSRSDFIVG KDGPIFLETN TLPGLTKASL
YPKALNALGI PFVDFLRGQI ALAERAVRS