Gene Rcas_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1696 
Symbol 
ID5539174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2189309 
End bp2190679 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content62% 
IMG OID640893835 
Productpeptidase dimerisation domain-containing protein 
Protein accessionYP_001431806 
Protein GI156741677 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.151235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00152068 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTGGC AGACATATCT GCGCGAACAG CAAGACCGGT TCCTTGCCGA GTTGCTCGAT 
TTCCTGCACA TTCCGAGTGT GTCGGCGCTG CCAGAGCACG CGGGTGATGT GCAGCGTGCG
GCGGAGTGGG TGGCGGAACG GATGCGCACC GCCGGGATCG AGTCGGTGCA GATTTTGCCA
ACCGGCGGGC ATCCCGTCGT GTATGGCGAT TGGTTGCACG CGCCTGGAAA GCCAACGGTG
CTGATCTATG GGCATTTCGA TACGCAGCCT GCCGACCCGC TGGAGTTGTG GGAACATCCA
CCGTTCGAGC CGGTGGTGCG TGATGGACGG GTGTATGCGC GTGGCGCATC GGACGACAAG
GGAAACATGC TGCCGCCGAT CCTGGCAGTT GAGGCATTGT TGCGCACAAC GGGCGCGCTG
CCGGTGAATG TCCGCTTTCT CTTCGAGGGC CAGGAAGAGA TCGGCAGCCC GCAGATTCCG
GCATTTGTGA AGGCTCACCG CGAGATGCTG GCGTGCGACC TGGTGGTGAG CAGCGATGGC
GGGCAGTGGA GCGAGACCGA GCCGGTCATC CTCACCGGTC TGCGCGGCGG ATGTGGCGTA
CAGATCGATG TGCGCGGTCC GAACCGGGAC CTCCATTCCG GCATTTACGG TGGCGCAGTG
CAGAACCCGA TCCATGCGCT GGCGTCCATT CTTGCGTCGA TGCGCGGCGC CGACGGGCGC
ATTCTGGTCG AAGGGTTCTA CGATGCGGTG CAACCGCTGA CCGATGACGA GCGTCGGCGG
TTCGCGTCGG CCCCCTTCGA TGAAGCCGCA TATATGGCGG ACCTGGGAGT GACGGCATTG
TGGGGAGAAG CCGGATACAC TGTCTACGAA CGCACCTGGG CGCGCCCCAC GCTGGAGATC
AATGGCGTCT GGGGCGGATT CCAGGGCGAG GGAGTGAAGA CGGTCCTGCC GGCTGAGGCG
CATGCCAAGA TTACCTGCCG ACTGGTCGCC AATCAGGACC CGGCGACGAT TGTGGAGTTG
ATCAAGGCGC ATGTGCAACA GCACACACCG CCAGGAGTCA CGGTAGCGGT CACGCCGCTG
AAATTTCTGG CGAAGCCGTA CCTGATGCCA TTCGATCATC CGGGAAACCG TGCCGCGCGC
GATGTGCTGG TTCACATGTA CGGACGCGAA CCGTATGAAG TGCGTAGCGG CGGCAGTATT
CCGATCTGCA CTATCTTGCT GGACGAATTA GGGGTATACA CGGTCAATTT CGCCTTCGCG
TTAGAAGATG AGCGTCAGCA CTCGCCGAAC GAATTCTTCC GCCTGAGCAG TTTCCGTCGT
GGGCAGGAGG GGTACTGCCT GTTGTTGGAG CGATTGGCGG ATGTGGGGTA G
 
Protein sequence
MTWQTYLREQ QDRFLAELLD FLHIPSVSAL PEHAGDVQRA AEWVAERMRT AGIESVQILP 
TGGHPVVYGD WLHAPGKPTV LIYGHFDTQP ADPLELWEHP PFEPVVRDGR VYARGASDDK
GNMLPPILAV EALLRTTGAL PVNVRFLFEG QEEIGSPQIP AFVKAHREML ACDLVVSSDG
GQWSETEPVI LTGLRGGCGV QIDVRGPNRD LHSGIYGGAV QNPIHALASI LASMRGADGR
ILVEGFYDAV QPLTDDERRR FASAPFDEAA YMADLGVTAL WGEAGYTVYE RTWARPTLEI
NGVWGGFQGE GVKTVLPAEA HAKITCRLVA NQDPATIVEL IKAHVQQHTP PGVTVAVTPL
KFLAKPYLMP FDHPGNRAAR DVLVHMYGRE PYEVRSGGSI PICTILLDEL GVYTVNFAFA
LEDERQHSPN EFFRLSSFRR GQEGYCLLLE RLADVG