Gene Rcas_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1209 
Symbol 
ID5538675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1562359 
End bp1563546 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID640893341 
Productaminotransferase class I and II 
Protein accessionYP_001431324 
Protein GI156741195 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA GGCTTGCCCG CCGTGTGGCA GGGTTTGGCA CGACGATCTT CACCGAGATG 
AGCGCGCTGG CGCTGGAGCG CGGCGCGATC AATCTGGGGC AGGGATTCCC CGATTTTCCG
GGTCCGTCAT TTGTGAAGGA AGCGGCAACC GCAGCCATTG GCGCCGACAT CAACCAGTAT
GCGCCGATGC CCGGTCTGCC CCGCCTGCGG CAGGCGGTTG CCGCGCAGTG GGAATGCGAC
TATGGCCGCG CAGTGGACTG GCAGCGCGAA GTGACGATCA CCAGCGGCGC GACGGAGGCG
CTCTGCGATG CGATGCTGGC GCTCCTTGAT CCGGGGGATG GAGTCGTCAT CTTTGAGCCG
GCGTATGACG CTTATGTGCC CGATATTACG CTGGCAGGCG GCACGCCGCT GCCGGTGCGC
CTGTATCCGC CCGTCGCCGA TCATACGGCG TGGTGGTTCG ATCCGGTGGA ACTGCACGCC
GCCTTCGCGC GCAAACCGAC GCTCATCATT CTCAACACGC CGCACAACCC GACCGGCAAA
GTCTTTACCC GCACCGAATT GGAACTCATC GCCCATCTTT GTCAGGAGTA CAACACGATC
GCCATTACCG ACGAAGTGTA CGACCGGTTG GTGTTCGACG GCGCGGCGCA TATTCCGCTG
GCGACGCTCC CCGGCATGTG GGAGCGCACC TTGACGCTCA ACAGCGCTGG AAAGACCTTT
AGCGTCACCG GCTGGAAGAT CGGCTACGCG GTCGGACCGG CGCATCTGAA CCATGCACTG
CGTCAGGCGC ATCAGTGGGT GACGTTCGCC ACAGCATCGC CGTTGCAGGA AGCCATCGCC
ACAGCGCTGG AACAGGCGTC GGTCAACGGC TACTACCGCG ACTTGCTGCG CGACTACGGC
GAACGCCGGG CGCGACTGGA ACAGGCGCTC GAAACTGCCG GATTGCCGGT GCTGCCGGTG
GAGGGCGCCT ATTTCATTTC TGCCGACATC AGCGCTACTG GCTGGACCGA CGACCGTGCC
TTCTGTCGCT GGTTGACGAC CGACATTGGC GTTGCGGCGA TCCCGACGTC GGTGTTCTAC
AGCGATCCGG CGAGTGCGCC GTGTTTGGCG CGTTTCTGTT TCGCCAAGCG GCTGGAGACC
ATCGATGCCG CAGCAGAGCG CCTGGCGGCG CTGCGCACTC GTTGGTGA
 
Protein sequence
MSGRLARRVA GFGTTIFTEM SALALERGAI NLGQGFPDFP GPSFVKEAAT AAIGADINQY 
APMPGLPRLR QAVAAQWECD YGRAVDWQRE VTITSGATEA LCDAMLALLD PGDGVVIFEP
AYDAYVPDIT LAGGTPLPVR LYPPVADHTA WWFDPVELHA AFARKPTLII LNTPHNPTGK
VFTRTELELI AHLCQEYNTI AITDEVYDRL VFDGAAHIPL ATLPGMWERT LTLNSAGKTF
SVTGWKIGYA VGPAHLNHAL RQAHQWVTFA TASPLQEAIA TALEQASVNG YYRDLLRDYG
ERRARLEQAL ETAGLPVLPV EGAYFISADI SATGWTDDRA FCRWLTTDIG VAAIPTSVFY
SDPASAPCLA RFCFAKRLET IDAAAERLAA LRTRW