Gene Rcas_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2066 
Symbol 
ID5539546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2652305 
End bp2653483 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content62% 
IMG OID640894201 
Productpeptidase M24 
Protein accessionYP_001432170 
Protein GI156742041 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.804468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTACT ATCAGAAAGC TGCTCAGGCA GAGGCATTGC TTGCCGAAAC CAATCTCGAT 
GCCTGGCTCA TCTTCGCACG TGAAAGCGTG ATCCGTCCTG ACCCAGGCAT CGAACTCGTT
GTTGGTGTTG ATGTGACGTG GGACTCGGCA TTCATCTTTG GGCGCAATGG GCAGCGCGTC
GCGCTTGTGG GGCGTTACGA CGTCGCCGGT GTGCGCGAGT CGGGGCTGTT TCCGACGATC
ATTGGCTATG ATGAAAGCAT CCGCGATCAT CTGATCGATG TCCTGCGCCG CCTTGATCCG
CTGACTATCG GGTTGAACTA TAGCCTCGAC GACCCGACAG CCGATGGACT GACTCACGGC
ATGTTTCTGC ATCTCTGCGA CCTGTTGGCA GACACGCCCT TTCCGTCACG TTTTGTGAGC
GCCGCGCCCC TGCTGGCGAA GTTGCGTTCG CGTAAGACGC CAGCCGAGGT GGAGCGCATT
CGCGCTGCCG TCGCCGTCAC CGAAGAGATC GTCGATCTGG TTGCACAACA GATTCGCCCT
GGCGTCAGTG AGGCGCAGAT CGCCGCTTTC GTTCATGAGG AGTTTCGCCG CCGCAATCTG
GCGAGCGCCT GGTCATGGGA TGCCTGCCCG ATTGTGAATA GCGGTCCCGA ATCAGAAGTC
GGTCATGGCG GTCCGCGTAA TGACATCGTG GTGCAGCCGG GGCATCTGGT GCACATTGAT
CTGGGCGTTC AGCGCGAGGG GTACTGCTCG GATATTCAGC GCATGTGGTA TGTGCGTCGT
CCTGGTGAAA CGGCGCCCCC GCCGGACGTT CAGCGCGCTT TCGAGACGGT GGTGCGGGCG
ATCGAGGCGG GCGCAGCGGC GCTGCGCCCC GGCGCGCGCG GATACGCGGT GGACGCAGCG
GCGCGTCAGG TCATTGTTGC AGCGGGATAC GACGAGTACC GCCACGCACT GGGGCATGGT
CTGGGGCGCG CCTGTCACGA CGGCGGTCCG TTGCTCGGTC CGCGCTGGCC TCGCTACGGC
AAAACGCCGG AAATGCATGT CGAGGCGGGG AATGTCTACA CCCTTGAACT GGGTGTCGTC
ACTGCGGCAG GGTATATCGG CATCGAGGAA GATGTGCTGG TGACTGACAA AGGCGTAGAG
TTTCTCTCGA CCTTTCAGCG CAGGTTGTGG GAGGTGTGA
 
Protein sequence
MLYYQKAAQA EALLAETNLD AWLIFARESV IRPDPGIELV VGVDVTWDSA FIFGRNGQRV 
ALVGRYDVAG VRESGLFPTI IGYDESIRDH LIDVLRRLDP LTIGLNYSLD DPTADGLTHG
MFLHLCDLLA DTPFPSRFVS AAPLLAKLRS RKTPAEVERI RAAVAVTEEI VDLVAQQIRP
GVSEAQIAAF VHEEFRRRNL ASAWSWDACP IVNSGPESEV GHGGPRNDIV VQPGHLVHID
LGVQREGYCS DIQRMWYVRR PGETAPPPDV QRAFETVVRA IEAGAAALRP GARGYAVDAA
ARQVIVAAGY DEYRHALGHG LGRACHDGGP LLGPRWPRYG KTPEMHVEAG NVYTLELGVV
TAAGYIGIEE DVLVTDKGVE FLSTFQRRLW EV