Gene Rcas_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3323 
Symbol 
ID5540821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4336197 
End bp4337285 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID640895440 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001433391 
Protein GI156743262 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGA ATTTCACTAT TCTGGCAATC GAAACATCGT GCGACGAGAC GGCAGCAGCG 
GTTATCCGTG GCGGTCGCAT GATCGTCTCG AATGTCGTGG CGTCACAGAT TGAGGAGCAT
CGTCGCTACG GCGGCGTCGT TCCCGAAGTC GCGTCGCGTC AGCATATTTT GACGATCGAT
GCCGTGGTGC GTGATGCGCT GCACCCGCTC CCTGGCGGGT GGAACGACAT CCATGCCGTC
GCAGCGACGT ATGGTCCTGG TCTGGCAGGC GCGTTGATGA CCGGGTTGAA TGTCGCCAAG
GCCATTGCCT GGATGCGCGA ACTGCCCTTC ATTGGGGTCA ACCATATCGA AGCGCATATC
TATGCGAACT GGTTGCTGAC CGATGCGCAG CCCGATGCGC CCGAACCACA GTTCCCCGTC
GTTGCGCTGG TCGTCAGCGG CGGGCATACG CTGCTGGCGC TACTCGAAGG GCACGGCCGC
TACCGCATGC TTGGGCAGAC CCGTGACGAT GCGGCGGGCG AGGCGTTCGA TAAAGTTGCG
CGGTTGCTGG GGCTTGGATT CCCTGGCGGA CCCGCCATTC AGACGGCGGC TGAAAACGCG
CCTGGCGGCG TCACGCTGCC GCGCGCCTGG TTGCGCGACA GTTACGATTT TTCGTTCAGC
GGCTTGAAAA CCGCAGTGCT CCACCAGATT CGCGAGTATC GGGCGCGCGA GGCGGCGCTT
CAGCCCGGCG CCGGCAAACG CGGCGCATCC GCAGCAACCG AACCACCGCC TCTTCCTCCA
GCGATTGTTG CGCGTCTGGC GCGCGCCTTC CAGGAATCGG TCGTGGACGT GCTGGTGACC
AAAACAGTTG AGGCGGCACG CGCCTTCGGC GCAGCCGAAG TGGTACTGGC AGGCGGCGTG
GCGGCCAACC TCCGTCTACG CGAGGAACTC TGTCGCCGCT CGCCTGTTCC GGTGCATATC
CCGCCTGTCG CCCTCTGTAC CGATAATGCC GCCATGATTG GCGCAGCCGC CTTCTACCGC
CTCAATGCTG GCAAACAGGA TGGATGGGAC CTCGATGTGC AGCCGAATCT TCCGTTACAT
GCGGGGTAG
 
Protein sequence
MDTNFTILAI ETSCDETAAA VIRGGRMIVS NVVASQIEEH RRYGGVVPEV ASRQHILTID 
AVVRDALHPL PGGWNDIHAV AATYGPGLAG ALMTGLNVAK AIAWMRELPF IGVNHIEAHI
YANWLLTDAQ PDAPEPQFPV VALVVSGGHT LLALLEGHGR YRMLGQTRDD AAGEAFDKVA
RLLGLGFPGG PAIQTAAENA PGGVTLPRAW LRDSYDFSFS GLKTAVLHQI REYRAREAAL
QPGAGKRGAS AATEPPPLPP AIVARLARAF QESVVDVLVT KTVEAARAFG AAEVVLAGGV
AANLRLREEL CRRSPVPVHI PPVALCTDNA AMIGAAAFYR LNAGKQDGWD LDVQPNLPLH
AG