Gene Rcas_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4452 
Symbol 
ID5541965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5721435 
End bp5722556 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID640896550 
Productpeptidase M24 
Protein accessionYP_001434486 
Protein GI156744357 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0539466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000869693 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAGG TGACACTTAC GTCTCCACGA ATCCCACGCA TTCAGGCGGC TCTGCGCCGT 
CATGGTTTCG ACGCGCTGGC TGTAGTTCCA GGAAGCAACC TGCGCTATCT TGCCGGTCTG
ACATTCCACG CCGGTCTGCG ACTGACCGTA ATGGTGACGC CGGTTGAGGG GCAACCAGCG
CTGGTGGTTC CCGGGTTGGA GTATGGGCGC GTGGCTGAGA CCACGGGCGC TGTGTTTCGA
TCCTATCCGT GGGGCGATGA TGAGGGACCG GGAAATGCCC TGATGCGCGC GGTGCGTGAT
ACCGGTCTGG GGCAGGGAAG CGTCGTCGGC ATCGAGCATA CCGTTATGCG CGTGTTTGAA
CTGCGTGCGC TGGAACAGGC GCTTCCCGGC GCACAGTTCG TTGATGCCAC GCCTCTCCTG
GCAGAACTGC GGATGGTTAA GGATGCGGCG GAACTGGAAG CGATGCGTGT TGCGGTGCAG
GTCATCGAAG CGACGTTGCA CCAGACGTTA ACACAGGTGC GGGCAGGCAT GCGCGAACGC
GACATCGCCG ATCTGTGGGA ACGCGCCATT CGCGCGGCTG GATGCCAGCC CGCCTTTGAG
ACGACGGTCG CCAGCGGACC GAACAGCGCC AACCCGCACC ATACCAGCGG TGATCGGGCG
TTGCAGGATG GCGACATGGT CGTGTTCGAC GGAGGCGCTA TGTATCAGGG ATATGTATCG
GACATTACCC GCACATGTGT AGTCGGGCAT CCATCGGACG AGATGCGTCG CGTGTACGAT
CTGGTGCTGG CGGCAAATGC GGCCGGACGG GACGCGGCGG CGCAACCCGG CGCGACCGGC
GCGTCGATCG ATGCCGCAGC GCGCCAGGTC ATTGAACGCG GCGGGTACGG ACCGTTCTTC
ATCCATCGCA CCGGGCACGG CATCGGTCTC GATGTGCATG AGCCGCCGTT CATCGTTGCC
GGAAGCCAGG CGCCGCTGCC GATTGGTGCG ACGTTTACCG TCGAGCCTGG CATCTACCTG
CGTGGCATAG GTGGTGTGCG CATCGAAGAT GACGTGGTCA TCACGGCTGA TGGCGCCGAG
TCGCTGACGA CATTCCCGCG TGAGATTCAC TCTATATCGT AA
 
Protein sequence
MAQVTLTSPR IPRIQAALRR HGFDALAVVP GSNLRYLAGL TFHAGLRLTV MVTPVEGQPA 
LVVPGLEYGR VAETTGAVFR SYPWGDDEGP GNALMRAVRD TGLGQGSVVG IEHTVMRVFE
LRALEQALPG AQFVDATPLL AELRMVKDAA ELEAMRVAVQ VIEATLHQTL TQVRAGMRER
DIADLWERAI RAAGCQPAFE TTVASGPNSA NPHHTSGDRA LQDGDMVVFD GGAMYQGYVS
DITRTCVVGH PSDEMRRVYD LVLAANAAGR DAAAQPGATG ASIDAAARQV IERGGYGPFF
IHRTGHGIGL DVHEPPFIVA GSQAPLPIGA TFTVEPGIYL RGIGGVRIED DVVITADGAE
SLTTFPREIH SIS