Gene Rcas_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1090 
Symbol 
ID5538556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1412674 
End bp1413789 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID640893224 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_001431207 
Protein GI156741078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCA AAGTGCGAGT GGGAGTCCTG TTCGGCGGGC AATCGGGCGA ACACGAGGTG 
TCGCTGGTCT CAGCGCGCGC GGTCATGGAT GGTCTCGACC CTGAACGATA TGATGTGACA
CCAATCGGCA TCACGAAGCA CGGGCGCTGG ATCATTGCCC CCGACGCGCA TACCCGTCTG
TTGGCGCAGG CCGATCCGGC AAAGTTGCCG GGAGGATCGT CTGCCGGTGC TGCTGATGCA
GCGGATGGCG TTGACGATGT GGAGACCACC GATCCGCTGA CCCTGATGGT TGCCGGTGAA
CAGTCACCCC TCAAAACGCT CGATGTCGTT TTTCCGGTCC TCCATGGACC GATGGGGGAA
GATGGAACGG TGCAGGGGTT GCTGGAATTG ATGGAAATCC CGTATGTTGG CTGCGGCGTC
ATGGCATCGG CAGTCGGCAT GGACAAAGCG ATGACGAAGG TTGCCTTTTC CGGCGCGGGG
TTGCCGGTGT TGCCCTGGCT GCTCATTCGA CGGCGCGAAT GGGAACGCGA ACCGGCACAC
GTGCTCGACT GGGTCGAACA GCGCCTCACC TACCCGATGT TCGTCAAGCC GGCAAACCTG
GGGTCGAGCG TCGGGATCAG TAAGGTGACC GGTCGCGCGG CGCTTGCGCG CGGCATCGCC
GAAGCTGCGG CGTATGATCG GCGGATCGTC GTTGAACAGG GTATTCCCGC GCGCGAGATC
GAAGTCAGTA TCCTGGGTAA CAATGAACCG GAAGCCAGCG TTCCCGGCGA AGTGGTTCCT
TCCGGCGAAT GGTACGACTA TGAAGCAAAA TACCTGAGCG GCGCATCGAA GATACTCATT
CCTGCGCCGA TCACGCTCGA ACTCGCGGCA CGGGTGCGGC GCCTGGCAGT GCAGGCGTTC
AAAGCCATCG ATGGCGCCGG GCTGGCGCGC GTCGATTTTC TGCTCAACCG CGAGACTGGC
GACCTGTATC TGAACGAGAT CAATACCATG CCCGGCTTTA CCCCGGTGAG CATGTACGCC
ATGATGTGGG AAGCCAGCGG TCTCCCCTAC ACCCAACTGC TCGACCGCCT GATCGAACTG
GCGCTCGAAC GGCGCGGCAG AGGGCGATAT GCGTAA
 
Protein sequence
MTRKVRVGVL FGGQSGEHEV SLVSARAVMD GLDPERYDVT PIGITKHGRW IIAPDAHTRL 
LAQADPAKLP GGSSAGAADA ADGVDDVETT DPLTLMVAGE QSPLKTLDVV FPVLHGPMGE
DGTVQGLLEL MEIPYVGCGV MASAVGMDKA MTKVAFSGAG LPVLPWLLIR RREWEREPAH
VLDWVEQRLT YPMFVKPANL GSSVGISKVT GRAALARGIA EAAAYDRRIV VEQGIPAREI
EVSILGNNEP EASVPGEVVP SGEWYDYEAK YLSGASKILI PAPITLELAA RVRRLAVQAF
KAIDGAGLAR VDFLLNRETG DLYLNEINTM PGFTPVSMYA MMWEASGLPY TQLLDRLIEL
ALERRGRGRY A