Gene RoseRS_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3082 
Symbol 
ID5210050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3874087 
End bp3875889 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content59% 
IMG OID640596673 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001277395 
Protein GI148657190 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.201317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00967911 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAATAA TGTATTTCAT CTCAATGCTG CGCCCCAACA CCCACCTGTA CGATGTGGCG 
CTGGATATCC ATCCCATCGA AGAACCAACC CTCGATCTGG CGCTTCCTGC CTGGACGCCA
GGATCGTACC TGATCCGCGA TTATGCGCGC CATGTACAAT CATTCGCCGT CACGGACGAT
CAGGGCGCGC CATTGCCATG GCGAAAGATC GACAAAACCA CCTGGCGTAT CGAAAACGGC
GCCGCCCGGC GCATCCGGGT CACCTATCAG GTCTATGCGT TCGAGTTGAG CGTCCGCACC
AGCCACCTCG ACAACACGCA TGGCTATTTC AATCCTTCTA ATATCTTCAT GTATCGCTGC
GGTCACGCTC ATGAACCATG CCTGGTCCAT GTCCAGACCC CTCCTGGATG GCGAATAACG
ACCGGTCTCG CACCTGCGCC GGAGCAAAAC GATGGATGGG TCACATTCCA CGCCCACGAT
TACGACGAAC TGGCTGATTC ACCGTTCGAG TGTGGCACGC ACCGCGTGCT GACGTTTGAG
GTGGACGGCA TTTCCCACGA CATTGCGCTC TGGGGACGCG GCAACGAAGA CGAGCAGCGG
ATACTGACCG ATACACGCAC CATTGTTGAA ACCACACGCG CTATGTTCGG TCGCCTGCCA
TACCGTCGCT ACGTCTTCAT CGTCCACCTG GTCGATGGCG GGTATGGCGG TCTGGAGCAT
CGTAACAGCG TATCGAATAT CGTTGATCGC TGGGGATTCC GTCCTGCACG TTCGTACGAA
AAGTTCCTGG CGCTCACTGC CCATGAGTTC TTCCACGTCT GGAATGTCAA GCGCATTCGC
CCCGCACCGC TGGGACCGTT CGACTACACC CGCGAAAACT ATACCCGTCA GTTATGGGTG
ATGGAGGGCA TCACCAGTTA CTACGATCAC CTGATCCTCC TGCGCGCCGG GTTGATCAGC
CGCGAACGCT ATCTCGAAAC GATCGCTGAC GATATCAAAC TGTTGCAGAG TCAACCGGGA
CGCGCGTTGC AGTCGCTCGA ACAGAGCAGT TTCGACGCCT GGATCAAGTT CTACCGCCCC
GATGAGAATG GACCCAACAG CAGCGTCTCG TACTATCTGA AGGGGAGCCT GGTTGCGCTC
CTGCTCGACC TGGAAATCCG ACGGCGCACC GGCGGCGCGC GTTCGCTCGA CGATGTGATG
CGCCACCTCT ATGCGGAATA TGCGGATGAC CACGTGCACG ACCTCTACAG CGGCGATCGG
GTGAAGCGCC CCGGTTTCGA TGACGACGAC GGCTTCTGCC GCGCAGTCGA AACCGTCGCT
GGCGAGGAAG ACGGCGCGTA CCGGACATTC CTGGCGCATG CAGTATCCGG CACAGGCGAG
CTTGATTATG CACGCGCCTT CGAAACAGTT GGATTGCACC TCGTGTGGGG ACATACGCTT
GAAAAAGAGA ATGATCATCT GCCAGCATGG CACGGGTTGC GTCTCAAGAC CGAGCATGGT
CGCCTCAAGG TATCGGTCGT CCTGGCGGGC GGACCCGGCG AAGCTGCCGG GATCTACGCT
GGCGACGAAC TGATTGCTCT CGATGGTGTC CGAATTGACG AAGAGCGCCT CAAGGCGCGG
ATGGCGGAGC GACAACCAGG ACAGACAGTT GTGTTCAGCC TGTTCCGGCG CGACGATCTC
CTCCACGTTC CGCTGCAGCT CGCCGAAGCG CCACCCGACA CCCTCACGAT CGCACCGGTC
GAGGCGCCAA CCGATGAGCA GACCCGCCAG CTCGAAGCAT GGTTGAAGGT GATAGCCTCC
TGA
 
Protein sequence
MTIMYFISML RPNTHLYDVA LDIHPIEEPT LDLALPAWTP GSYLIRDYAR HVQSFAVTDD 
QGAPLPWRKI DKTTWRIENG AARRIRVTYQ VYAFELSVRT SHLDNTHGYF NPSNIFMYRC
GHAHEPCLVH VQTPPGWRIT TGLAPAPEQN DGWVTFHAHD YDELADSPFE CGTHRVLTFE
VDGISHDIAL WGRGNEDEQR ILTDTRTIVE TTRAMFGRLP YRRYVFIVHL VDGGYGGLEH
RNSVSNIVDR WGFRPARSYE KFLALTAHEF FHVWNVKRIR PAPLGPFDYT RENYTRQLWV
MEGITSYYDH LILLRAGLIS RERYLETIAD DIKLLQSQPG RALQSLEQSS FDAWIKFYRP
DENGPNSSVS YYLKGSLVAL LLDLEIRRRT GGARSLDDVM RHLYAEYADD HVHDLYSGDR
VKRPGFDDDD GFCRAVETVA GEEDGAYRTF LAHAVSGTGE LDYARAFETV GLHLVWGHTL
EKENDHLPAW HGLRLKTEHG RLKVSVVLAG GPGEAAGIYA GDELIALDGV RIDEERLKAR
MAERQPGQTV VFSLFRRDDL LHVPLQLAEA PPDTLTIAPV EAPTDEQTRQ LEAWLKVIAS