Gene Rcas_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1968 
Symbol 
ID5539446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2518158 
End bp2519744 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content63% 
IMG OID640894103 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001432074 
Protein GI156741945 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.302268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGT ATTTTGCTCA TTCCGCTATC GATCCCGTCG ATCTTGAAGT GTTTCGCAAC 
CGGTGCGCGG CAATTGCCGA AGAGATGGGC GCGGCGCTTG GGCGGAGCGC GCTTTCGGCG
AATATCAAGG AGCGACGCGA TTTCTCCTGT GCGGTCTTCG ATGCACAGGG GCGGATGGTT
GCGCAGGCGG CGCATATCCC GGTGCATCTG GGCGCTATGC CGCGTTCAGT CGAGGCGGCG
CTGGCGCAGT TCACCCTTGC GCCGGGTGAT GTCGTTATTC TCAATGATCC GTACCTTGGC
GGCACCCATC TCCCCGACTT GACCACTGTC GCTCCAGTGT ATGCCGGTGC GACGCTGATC
GGCTATACGG CAACGCGTGC CCATCACGCC GATGTTGGCG GTATGACCCC CGGTTCGATG
CCGATGAGTC GTGAGGTCTA TCAGGAAGGA TTGATCATCC CGCCAACCCT TCTCGCGCGC
GGCGGCATCC CCGATGAGGC GGTGATCGCG TTGATTTGCC GTAATTCACG TACCCCCGAT
GAGCGGCGCG GCGATCTGGC TGCCCAGATG GCCTGCCATC GCGTCGGTGC GCAACGTCTG
GCGGAACTGG CAGAACGGCA TGGTCCGGCA TGGGTGGCGC GCCATATGGA GGCGCTCCTG
GCGTATGGCG AACGCCATAT GCGCGCAGTG ATCGCCGCGA TCCCAGACGG AACATATACA
TTTGAGGACG CGCTGGATAA CGATGGTGTT GATGCCGATC CGCTGACGAT CTGTGTGCGC
ATCGACATCC ATGGCGAATG CGCTGTCGTA GACTTTGAAG GCACATCGTC ACAGTGCCGT
GGTCCACTCA ATGCGCCCCG CGCTGTGACC GAATCGGCGG TTCTCTACTG TTTTCGCTGT
CTCGGTTCGC CAGATATGCC GTCGTCGGCT GGCGCGTTCG CGCCACTCGA CATTCGTGTT
CCGGCGGGAA GCATACTGGC GCCTCACTCT CCGGCTGCGG TTGCGGGTGG GAATGTCGAA
ACGGCGCAGC GAGTCGTTGA TGTGGTGTTT GGTGCGCTGG CGCAGGCGCT CCCCGACCGA
ATTCCCGCGG CGTCGGCAGG TACGATGAAT AACTGGACAT TTGGCGGCAT GGCGTCCAAT
GGTGTGCCCT TTGCCTACTA TGAAACGCTT GGCGGCGGGA TGGGCGCCCG CCCGACGCTG
CCGGGTCTCA GCGGGGTGCA GGTGCATATG ACCAATACGC TCAATACGCC GGTCGAGGCG
CTGGAGCGTC AGTTTCCGTT GATTGTGCGG CGCTACGGGT TACGCCGCGG CTCTGGCGGC
GTGGGGCGCG TGCGCGGTGG CGATGGTCTG GTGCGCGAGG TTGAGTTTCG AGCGCCGGTC
ACGGTCAGCC TGTTGACCGA ACGCCGGGTG TATGCGCCCT ACGGATTGTA CGGCGGCGCT
CCTGGATTGC GCGGGCGCAA TATTCTGCTG CGCGATGGCG AAGAACGCAT CCTGCCCGGC
AAGATTACTA TCGACTTGCG TCCCGGCGAC ATTCTGCGCA TCGAAACGCC TGGCGGCGGC
GGCTTCGGCG CTCAGGAGCA GGGATAG
 
Protein sequence
MSSYFAHSAI DPVDLEVFRN RCAAIAEEMG AALGRSALSA NIKERRDFSC AVFDAQGRMV 
AQAAHIPVHL GAMPRSVEAA LAQFTLAPGD VVILNDPYLG GTHLPDLTTV APVYAGATLI
GYTATRAHHA DVGGMTPGSM PMSREVYQEG LIIPPTLLAR GGIPDEAVIA LICRNSRTPD
ERRGDLAAQM ACHRVGAQRL AELAERHGPA WVARHMEALL AYGERHMRAV IAAIPDGTYT
FEDALDNDGV DADPLTICVR IDIHGECAVV DFEGTSSQCR GPLNAPRAVT ESAVLYCFRC
LGSPDMPSSA GAFAPLDIRV PAGSILAPHS PAAVAGGNVE TAQRVVDVVF GALAQALPDR
IPAASAGTMN NWTFGGMASN GVPFAYYETL GGGMGARPTL PGLSGVQVHM TNTLNTPVEA
LERQFPLIVR RYGLRRGSGG VGRVRGGDGL VREVEFRAPV TVSLLTERRV YAPYGLYGGA
PGLRGRNILL RDGEERILPG KITIDLRPGD ILRIETPGGG GFGAQEQG