Gene Rcas_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4199 
Symbol 
ID5541710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5433872 
End bp5435227 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID640896308 
Productadenylosuccinate lyase 
Protein accessionYP_001434246 
Protein GI156744117 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.225542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.131988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTA CCGCTCTTTC ACCCCTCGAC GGGCGCTATC GCGCCGATGT CGCCGATCTC 
GAAGCCTATT TCAGCGAGGC GGCGTTGTTT CGCTATCGGC TGCGTGTCGA GGTGGAGTAT
CTGCTGTTTC TGACACGCGC GCGGGACGTC GATTTCGTGC CGAAACTCGA TCCGCGCGCT
GCGGCGGCGC TGCGCAATCT CTACCGTTCA TTCTCCGCCG ACGATGCCGA AGCGATTGCC
GCATGGGATC GCAAGGTCAA CCACGATGTC AAGGCGGTCG AATACTGGCT GCGCGAACGG
CTCGATGCGC TTGGTCTCGG CGCCTGGAAG GAGGCGGTAC ACTTCGCGCT GACATCAGAA
GACATCAACA ATCTGGCATA TGCGCTCATG ATTCGCGAGG CGCGCGATCT GACGCTAATG
CCCGCTGTCG GCGAGATTAT CGTGGCGCTG CGCGACCTGG CGTTTGCAGA AGCCGAAACG
CCAATGCTGG CGCGCACCCA TGGGCAGCCT GCCACGCCAA CGACCTTCGG CAAGGAGATG
GCAGTCTTCG CTGCGCGCCT CCAACGCGCG CACGAACAGG CGTCCGGCAT CCGACTCACC
GGTAAACTGA ACGGCGCCAC CGGTACATTC GCAGCGCACG TTGCAGCGTT GCCACAGGTA
GACTGGATCG CGTTCAGTCG CGGCTTCATC CGCTTCCTCG ACCTTGAGCC AGTGCTGTTG
ACAACCCAGA TCGAACCTCA CGATACGCTC GCCGAACTGT GCGACGCGCT GCGGCGCCTC
AACACGGTGG TGATCGATCT CTGCCAGGAT ATGTGGCGCT ATATCAGCGA CGGCTCCCTG
GTGCAGATCG CGCGCCCCGG CGAGGTCGGG TCCTCCACAA TGCCGCATAA GGTCAATCCT
ATCGACTTCG AAAACGCCGA AGGCAACCTG GGGCTGGCAA ATGCGCTTTT GGAGCACTTC
AGCCGTAAGT TGCCGGTTTC GCGCCTGCAA CGCGACCTGA CCGACAGTAC GGTGCTGCGC
AATCTCGGCG TTGCATTCGG ACATTGCCTG CTTGGATATC GGCGCGTCAG CAAGGGATTG
GGGAAGATCG CCGTTGACCG GGAACGCCTG CTGGACGACC TGCGGCATCA CCCGGAAGTG
CTGGCGGAAG CGGTGCAGAC TATCCTGCGC CGCGCCGGTT ACCCGGAGCC GTATGAGGCG
CTCAAGCGCC TGACGCGCGG GCGCGCGCTG ACGATGGAGA TGCTGCACGC ATTCATTGAT
GAACTTGATG TTTCACCGAC GGTGAAGGAG GAACTGCGCG CCCTCACGCC GGAAACCTAC
ACCGGGCTGG CTGATCGCCT GGCGCGCATG GTGTGA
 
Protein sequence
MSLTALSPLD GRYRADVADL EAYFSEAALF RYRLRVEVEY LLFLTRARDV DFVPKLDPRA 
AAALRNLYRS FSADDAEAIA AWDRKVNHDV KAVEYWLRER LDALGLGAWK EAVHFALTSE
DINNLAYALM IREARDLTLM PAVGEIIVAL RDLAFAEAET PMLARTHGQP ATPTTFGKEM
AVFAARLQRA HEQASGIRLT GKLNGATGTF AAHVAALPQV DWIAFSRGFI RFLDLEPVLL
TTQIEPHDTL AELCDALRRL NTVVIDLCQD MWRYISDGSL VQIARPGEVG SSTMPHKVNP
IDFENAEGNL GLANALLEHF SRKLPVSRLQ RDLTDSTVLR NLGVAFGHCL LGYRRVSKGL
GKIAVDRERL LDDLRHHPEV LAEAVQTILR RAGYPEPYEA LKRLTRGRAL TMEMLHAFID
ELDVSPTVKE ELRALTPETY TGLADRLARM V