Gene Rcas_3030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3030 
Symbol 
ID5540526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3925498 
End bp3927432 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content63% 
IMG OID640895150 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001433103 
Protein GI156742974 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC CGCAGATCGC GCCATACGGC TCGTGGCGCT CACCGATAAC TGCTGCCCTG 
GTCGCAACAT CGGGTGTTTC TCTTAGCACG ATTGCGCTCG ATGGCGACAA CATCTACTGG
CTCGAAGGGC GTCCCGCCGA GGGTGGGCGC GTGGTGGTGG TGCGACGCAC CGCCGATGGC
GCCATTGCCG ATGTGACGCC GCAGGGTTTC AATGTGCGCA CCCGCGTTCA CGAATATGGA
GGAGCGCCGT ACACGGTTGA CCAGGGTATG GTCTATTTCA GCAACTTTGC CGATCAGCGC
CTCTACTGCC AGCGCCCCGG TGCAGCGCCG GAACCGATCA CACCCGAAAC GCCATGGCGC
TATGCCGACT TCGAGGTTGA TCGTCGGCGC AACCGGCTGA TCGGCGTGCG TGAGGACCAC
TCTGGCAGCG GTGAAGCAGT CAATACGATT GTCGCCATCT CGCTCGATGG CGCTGCCGAA
CAGCGCGTGC TGATCAGCGG CGCAGATTTT TATGCGAATC CGCGGCTTAG TCCAGACGGC
CAATGGCTGG CATGGCTTTC CTGGAACCAT CCGAACATGC CATGGGACGC CGCTGAGCTG
TGGGTCGCGC CGGTGCGCGA AGATGGGATG CCGGGTGCTG CCGAACGGAT CGCCGGCGGT
CCTGACGATG CGGCGTTTCA ACCAGCGTGG GGACCGGACG GCGCGCTCTT CTTTGTCGCC
GAGCGCACCG GTTGGTGGAA CCTTTACCGC TGGCACAATG GTGTTGTCCA CGCGCTCTGT
CCGATGGAAG CCGAATTCGG TCTGCCACTC TGGGTCTTCG GCGCACGCAC CTATGCTGTC
GAGTCGGAGG ATCGCCTGGT CTGCACGTAT ATCGAGCGCG GCGAGCACAA AATGGCACTG
CTTGATGTCC GAAGTGGGAA CCTGACGCCG CTCGAACTGC CGTTCAGCGA TTTCGGGTTC
ACCGGTCCGC GCGCCACTGG CGGCAGAGTC GTCTTCGTTG GCGCCTCACC AGCCGCGCCT
GCTGCCCTGG TCATGCTCGA CCTGGCGAGT GGTGCGCTGA CAACCGTTCG CCGCTCGATG
GAGATGCAGA TCGACCCTGG CTTTATCTCG ACGCCGCAGG TGATCGAATT TCCCACCGAA
GGCGGCGTGA CTGCGTTCGG CTTCTATTAC CCGCCGCGCA ACCGTGATTT TCTGGCGCCG
GAAGGCGAAA AGCCGCCGTT GCTCGTCCTG AGCCATGGAG GACCGACCGG CGCAACCTCG
GCGTCATTTG ATCCCGGCAT TCAGTTCTGG ACGAGCCGCG GCATTGCAGT GATGGATGTC
AACTACGGCG GCAGCACCGG ATTCGGGCGC GCCTACCGCC AGCGCCTCGA CGGTCGGTGG
GGCATTGTGG ACGTCGACGA CTGCTGCAAT GCGGCGATGT ACCTGGCAGC GCAGGGGCTG
GCAGACCCGG AACGTCTGAT CATCGCCGGC GGCAGTGCCG GCGGGTACAC CACGCTGGCG
GCGCTCACCT TCCGCCACGT GTTCAAAGTC GGCGCCAGTT TCTACGGCGT CAGCGACCTG
GAGGCGCTGG CGCGCGACAC CCATAAGTTC GAGTCGCGCT ACCTCGACCG GTTGGTAGGA
CCATACCCGG AGCGCGTCGA TATCTACCAC GCGCGCTCGC CGATCTATCA TATCGAGCGG
CTCAACTGCC CGGTGATCTT CCTGCAAGGG CTGGAAGACA AAGTCGTACC GCCGGATCAA
TCCGAGCGGA TGGCGGCGGC GCTGCGCGCG AAGGGCATTC CGGTCGCGTA TCTGGCGTTC
GAGGGCGAGC AACACGGTTT TCGTAAAGCA GAGACCATCA TTCGTGCGCT GGAAGCCGAG
TTATACTTCT ACGCGCGTAT CCTGGGGTTT GAACTCGCCG ATCCGGTCGC GCCGATTGTA
ATCGACAATC TGTGA
 
Protein sequence
MTHPQIAPYG SWRSPITAAL VATSGVSLST IALDGDNIYW LEGRPAEGGR VVVVRRTADG 
AIADVTPQGF NVRTRVHEYG GAPYTVDQGM VYFSNFADQR LYCQRPGAAP EPITPETPWR
YADFEVDRRR NRLIGVREDH SGSGEAVNTI VAISLDGAAE QRVLISGADF YANPRLSPDG
QWLAWLSWNH PNMPWDAAEL WVAPVREDGM PGAAERIAGG PDDAAFQPAW GPDGALFFVA
ERTGWWNLYR WHNGVVHALC PMEAEFGLPL WVFGARTYAV ESEDRLVCTY IERGEHKMAL
LDVRSGNLTP LELPFSDFGF TGPRATGGRV VFVGASPAAP AALVMLDLAS GALTTVRRSM
EMQIDPGFIS TPQVIEFPTE GGVTAFGFYY PPRNRDFLAP EGEKPPLLVL SHGGPTGATS
ASFDPGIQFW TSRGIAVMDV NYGGSTGFGR AYRQRLDGRW GIVDVDDCCN AAMYLAAQGL
ADPERLIIAG GSAGGYTTLA ALTFRHVFKV GASFYGVSDL EALARDTHKF ESRYLDRLVG
PYPERVDIYH ARSPIYHIER LNCPVIFLQG LEDKVVPPDQ SERMAAALRA KGIPVAYLAF
EGEQHGFRKA ETIIRALEAE LYFYARILGF ELADPVAPIV IDNL