Gene RoseRS_4305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4305 
Symbol 
ID5211289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5408017 
End bp5409258 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content60% 
IMG OID640597891 
Productdipeptidyl aminopeptidase/acylaminoacyl-peptidase-like protein 
Protein accessionYP_001278595 
Protein GI148658390 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0350755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.767198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAGC AGGACGTCAT CCTGAAGCGG CAGAAGTACA ACACGCGGTT CAAGAACGGC 
GATATGGACT TCATGTTCAA CTGGGCGCTG GGTGTGAGCC AGATTGTCGG TATGTCGCCA
TCACAGGTCT TCTACGCCGT CCACGATATC AGAGACGGCG ATCCAGACGG TTGGCGCGAT
GGTTTCTGGC GTCAGGGCGA TTATCAGGTC GAACGGGCGC GAGAGTTTCT CAAACACGGT
CAGCAACTGG CGGCTGGACA GTTGCACCTC GGTGCCGCAT ATGCGTACCG TTCGGCGTTG
CAATACACCC ACCCCAGCGC CAGCGATTTC AATACGCGCG TGCAAACAAT GGAGCGCGCG
TTTCAGCAGG GTGTCCACCT GATCGGCATC CCGATGCGTC CTATCGAGAT TCCGTTCGAG
CACGCCGCAC TGCCGGGTTA TTATCTGGAG CACGATGAGC AGTCGCGCCC GGTTGTGATG
ATGGTCGGCG GCGGGGATAC ATTCCGTGAA GACCTGTTCT ACTTTGCGGG GTACCCTGGC
TGGAAACGCG GCTACAACGT GGTGATGGTC GATCTGCCGG GGCAGGGTGT CACGCCAGAC
CGGGGGCTGC ACTTCCGTGC AGACATGGAA CGACCGATCA GCGCCGTGCT GGACTGGCTC
GAAGCGCACT CCGCCGCTCG TCCCACGCAG ATCGCCATCT ACGGCGTCAG CGGAGGCGGA
TACACGACGG CGCTGGCAGT GTCGTCCGAC CCGCGCATCA GCGCCTGGAT TGCCAGCACT
CCCATTTTCG ATCTGGTCGA AGTGTTCCGA CGCGAGTTCG GCAGCGCGAT GAAAGCGCCC
GGCTGGGTGA TCAACACGTT CATGCGGTTG GCGGGCATGC TGAACAAAAG TGCGGAGATC
AATCTCGACA AGTATGCCTG GCAATTTGGC GCAACCGATT TCAAGAGCGT CGTTGATGGC
GTCGTTGCCC TGGCAAAGCG AGTGGACTAC ACGGGGATCG CCACGCCATC ATTGTTTCTC
ATGAGCGAAG GGGAAGGCGA TGAACTCAAG CGCCAGACGC TCGAAATATA CCATGATCTC
CGTCGACGCG GCGTCGACGT CACTCTCTGC GAATTTACCG CCGCCGAAGG TGCAGACGGT
CACTGCCAGG TGAACAATCT GCGGCTGGCG CACCTGGTCA TCTTCGACTG GCTCGACCGC
GTGTTTGGGC ATACGCCAGG CGATAGGCGA CTGTGGGTGT GA
 
Protein sequence
MQEQDVILKR QKYNTRFKNG DMDFMFNWAL GVSQIVGMSP SQVFYAVHDI RDGDPDGWRD 
GFWRQGDYQV ERAREFLKHG QQLAAGQLHL GAAYAYRSAL QYTHPSASDF NTRVQTMERA
FQQGVHLIGI PMRPIEIPFE HAALPGYYLE HDEQSRPVVM MVGGGDTFRE DLFYFAGYPG
WKRGYNVVMV DLPGQGVTPD RGLHFRADME RPISAVLDWL EAHSAARPTQ IAIYGVSGGG
YTTALAVSSD PRISAWIAST PIFDLVEVFR REFGSAMKAP GWVINTFMRL AGMLNKSAEI
NLDKYAWQFG ATDFKSVVDG VVALAKRVDY TGIATPSLFL MSEGEGDELK RQTLEIYHDL
RRRGVDVTLC EFTAAEGADG HCQVNNLRLA HLVIFDWLDR VFGHTPGDRR LWV