Gene Rcas_3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3312 
Symbol 
ID5540810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4322582 
End bp4323610 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content59% 
IMG OID640895430 
Productmembrane dipeptidase 
Protein accessionYP_001433381 
Protein GI156743252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.965789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.649064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCAG CACTGCCAGT CTTCGATGGT CATAATGACA CTTTACTGCG TCTCTACACC 
GCAGAACACA ACGACTCATT CTTTTCATCA CCACACGGAC ACCTCGATCT GTCACGCGCC
CGCACAGGCG GTTTTGCCGG CGGTTTCTTC GCAGTCTTTG TTCCGCCGCC AACCCATGCG
TCATCGCCCG ATGATGGCAT CCTGCCCGAT CCGCTGCCCT ACGCCTACGC GCTCCAAACC
ACACTGGGCA TGGCGGCGCT CCTCTTTCGC ATCGAATCCC AATCCGCCGG GCAGGTTTGC
GTTGCGCGCA CGGTCAGCGA AATCGAGTAC GGACTTCAGA CCGATACCCT GAGCGCCATC
CTGCACCTGG AGGGCGCCGA TCCCATCGAT ACTGAGTTTC ATACGCTCGA AGTGTTGTAT
CAGGCGGGTC TCCGTTCGCT TGGCATCGTC TGGAGTCGAC CAAATGCATA TGGATACGGC
GTACCGTTTC GTTTCCCCCA TACCCCCGAT ATTGGTCCCG GTCTGACCGA TGCCGGACGT
GAACTGGTTC GGGCGTGCAA TCGTCTTGGC GTTCTGATCG ACCTTTCTCA CCTGAATGAA
GCCGGTTTCT GGGATGTGGC ACGGTTGAGC GATGCGCCGT TGGTTGCCAC ACACTCGAAT
GCGCACACAA TCTGCCCATC GCCACGCAAC CTGACCGACC GGCAACTCGA TGCAATCCGC
GAGTCGGATG GGATGGTCGG GGTCAATTTC CATGTTGGCT TCCTTCGCCC GGATGGGCAG
CGTGACGCGA ACACGTCGCT GGACGTCGTG GCGAATCATG TCATCTACCT GGTCGAGCGC
CTGGGGCTTG ATCGCGTGGG TTTCGGTTCG GATTTCGACG GCGCGCTGAT GCCATATGAT
CTCGGAGATG CCGCCGGATT GCCACGTCTG CTGGAAACAT TGCGCCGTCG CGGGTTCGAT
GCGCCATCGC TGCGCAAACT GGCGCACGAA AACTGGCTGC GCGTCTTGCG AAAAACATGG
CGATCATGA
 
Protein sequence
MTSALPVFDG HNDTLLRLYT AEHNDSFFSS PHGHLDLSRA RTGGFAGGFF AVFVPPPTHA 
SSPDDGILPD PLPYAYALQT TLGMAALLFR IESQSAGQVC VARTVSEIEY GLQTDTLSAI
LHLEGADPID TEFHTLEVLY QAGLRSLGIV WSRPNAYGYG VPFRFPHTPD IGPGLTDAGR
ELVRACNRLG VLIDLSHLNE AGFWDVARLS DAPLVATHSN AHTICPSPRN LTDRQLDAIR
ESDGMVGVNF HVGFLRPDGQ RDANTSLDVV ANHVIYLVER LGLDRVGFGS DFDGALMPYD
LGDAAGLPRL LETLRRRGFD APSLRKLAHE NWLRVLRKTW RS