Gene Rcas_4219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4219 
Symbol 
ID5541730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5458140 
End bp5459615 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content65% 
IMG OID640896326 
ProductO-succinylbenzoate-CoA ligase 
Protein accessionYP_001434264 
Protein GI156744135 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.121755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG ATTGGCTCTC GGCGCAGGCG CAGGCGCGTC CCGAAGGTGC GGCGCTGATC 
ATCGGCGACA CAACACTGAC GTACCGCGCT CTGCACGAGC AAACGGCGAC GTTCGCTTCC
CGCCTCGCTG CGGCTGGCGT CGAGCAAGGC GCGGTTGTCG GCGTGCTGTT GTCGAATCGT
CTCGAAGCGG CGCTGGCGGT GCATGCCGCG CCGCGCCTCG GCGTGACGCT GGCGCTGTTC
AACACCCGCC TGACCCCTGC CGAACTCGAT GCGCAGGTGC GCGCAGCAGT GTGTCGCATC
CTCGTGTGTG AGCGCGACAC GCTGTTGGCA GCGCTGGCGC TTCCTTCGGC GCCCCATGTG
CTGTGCGTCG ATCCGGTCGA CGACCCACGC CTGACGCCGG TTGACCGGAT TTCCGGGGAT
AGCGCCGCAT ACTGCGAAGG CGCCATCGAC CTCGATGCGC CGTTTGTGAT GATGTTCACT
TCGGGGACAA CGGGGACGCC ACGGGGAGTA GTGCTGACCT ACGGCGCATT CTTCGCCAGC
GCGATGGCGT CGGCATACCG CATCGGCGTT CTGCCGGGCG ACCGCTGGCT CTGTGTGTTG
CCACTCTATC ACATTGGCGG TCTCAGCATT CTGCTGCGGT CCTGCCTCTA CGGCACGGCA
GTGGATCTCT GGCAACGTTT CGACGCTCCG GCAATCACAG AACGTTTGAA GGCGACACCG
ATCACACTCA TTTCGCTGGT GCCGACGATG CTCTACCGCC TGCTCGATGA CGCTGGCGAT
GCGCCACCGA ACCTGCGGCT CGTGCTGCTT GGCGGAGCTG CTGCGCCAAC CGATCTGCTG
GAGCGCGCAC TGGAAGCAGG ATGGCCCATT GCCACAACCT ACGGGCTGAC CGAGGCAGCG
TCGCAGGTGG CGACGGCGCT GCCCGACGAG GTACGGCGCA AGCCCGGCAG CGTCGGGCGA
CCGCTGATCT TCACCCACGT GCGTGTGACG AACGAACAGG GACGCGACCA ACCACCCGGC
GTCTACGGCA ACATCCTGGT GCGGGGTCCG ACCCTGATGC GCGGATACCT CGGCGAAACG
CCGCTCGACG CCGACGCCTG GTTTGCCACC GGAGACATCG GCTATCTCGA CGCCGACGGC
GACTTGTGGG TAGTGCAGCG ACGCAGCGAC CTGATTATCA GCGGCGGGGA GAATATCTAT
CCGGCGGAAG TCGAACAGGC GCTGCGCCAG CACCCCGCAG TCGCCGATGT TGCAGTCGTT
GGCGTGCCAT CAGCGGAGTG GGGGCAGCAG GTCGGCGCTG CCATCGTCCT GCGCGACCCA
TCGGTGAGCG TCGAAGCAAT CCTGGCGTTC AGCCGCACTC GTCTGGCGGG ATACAAACAA
CCGCGCGTCG TTCGCATCGT CGCTGAGTTG CCGCGCACCG CATCGGGAAA GATTCAGCGG
GAAGCGGTGA TCAATCTGTT GAAGGTTGCA GGTTAA
 
Protein sequence
MMRDWLSAQA QARPEGAALI IGDTTLTYRA LHEQTATFAS RLAAAGVEQG AVVGVLLSNR 
LEAALAVHAA PRLGVTLALF NTRLTPAELD AQVRAAVCRI LVCERDTLLA ALALPSAPHV
LCVDPVDDPR LTPVDRISGD SAAYCEGAID LDAPFVMMFT SGTTGTPRGV VLTYGAFFAS
AMASAYRIGV LPGDRWLCVL PLYHIGGLSI LLRSCLYGTA VDLWQRFDAP AITERLKATP
ITLISLVPTM LYRLLDDAGD APPNLRLVLL GGAAAPTDLL ERALEAGWPI ATTYGLTEAA
SQVATALPDE VRRKPGSVGR PLIFTHVRVT NEQGRDQPPG VYGNILVRGP TLMRGYLGET
PLDADAWFAT GDIGYLDADG DLWVVQRRSD LIISGGENIY PAEVEQALRQ HPAVADVAVV
GVPSAEWGQQ VGAAIVLRDP SVSVEAILAF SRTRLAGYKQ PRVVRIVAEL PRTASGKIQR
EAVINLLKVA G