Gene Rcas_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3843 
Symbol 
ID5541347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5023605 
End bp5024924 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content60% 
IMG OID640895953 
Productaminotransferase class-III 
Protein accessionYP_001433898 
Protein GI156743769 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0711014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.172621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGA CGATGACGAA CGCAACGCTG GTCGAGCGCG CCCGGCAGGT GATTCCCGGC 
GGGGTGAATT CCGGCAATCG CGTTCTTCCC TGGCCCATTG CGTTTGTGCG CGCCGAAGGC
GCCTATCTGT TCGACGCCGA TGACCGCCAG TATCTTGATT ATCACGCCGC CTTCGGACCG
ATCATTCTCG GACACAACCA TCCTCAGGTG AATGCCGCTG TTGCCGAAGC GATGAGCCGC
ATCGACATCA TTGGAGCAGG CGTCACAGAC CTGGAAGTGG AACTTGCCGA CCGCCTCAAC
CGCCATATTC CCTGCGCCGA GCGCGTCCTG CTGACGAACT CCGGCTCTGA AGCGACGTAT
GCCGCGCTCC GTCTGGCGCG CGCCGTCACC GGGCGCAACA AGATCATCAA GTTTCAAGGG
ACCTACCACG GCTGGCACGA TGCCGTCTTG ATGAATGTCA TCAGCCCGCC GGAAAAGATC
GGTCAGCATG ATCCGCTCTC GCTCGGCATG CTTCCCGATG TGATCCGTCA CACGATTGTG
TTGCCGTTCA ACGATACCGA AGCGGTCGCC GACACGCTGC ACCGCCAGGG TGAGGAGATC
GCCGCCGTTC TTGTGGAGGT GATCCCGCAC AATATCGGGT GTGTGTTGCC ACGGCCGGAG
TTTCTTCAGG CGCTGCGCGA CCTGACGCGC CAGCACGGTG TGATGCTGAT CTTCGATGAG
GTCATCACCG GTTTTCGCCA TGCCCTCGGT GGCTATCAGT CGATTGTCGG AGTGACGCCC
GACCTGGCAA CGTTTGCCAA GGCGATGGCG AATGGTTTCC CTATTGCTGC GCTTGCGGGT
CGCGCAGAAC TGATGGATCG TTTTGCGCCC GGCGGCGGGG TGTTCTTTGC CGGAACGTAC
AATGGGCATA GCATCGGCGT AGCGGCGGCG CTGGCGACGA TTGCAGAACT CGAGAGCGGC
GAGGTTCATG CACACTGCTT TGCGCTGGCG CAGATCGCCG CAGACGGGTT GATGCAGATT
GCTGCCGAAC TCGGCATTCC GCTCACGGTC GCGCGCTTCG GGTCGGTGTT CGTTCCCTAC
TTTATGGAAC CCGCTCCGAT CGAGAACTAT ACCGATCTGT TCCGCAACAA TACGGCGCGA
GACCTCTGGT TCCGCAAAAC GATGTGCGAG CACGGTATCT TCATGATCCC GACAGCCCTC
AAACGCAATC ATGTAAGTGC GGCGCACACT CGTGCCGATA TTGATCGCAC GCTGGAGATC
GCTCGCCAGG TGTTGCGTGC GATGCCGGCG GATCTTGAGC GCAGTGCTCG TTATCCATGA
 
Protein sequence
MTATMTNATL VERARQVIPG GVNSGNRVLP WPIAFVRAEG AYLFDADDRQ YLDYHAAFGP 
IILGHNHPQV NAAVAEAMSR IDIIGAGVTD LEVELADRLN RHIPCAERVL LTNSGSEATY
AALRLARAVT GRNKIIKFQG TYHGWHDAVL MNVISPPEKI GQHDPLSLGM LPDVIRHTIV
LPFNDTEAVA DTLHRQGEEI AAVLVEVIPH NIGCVLPRPE FLQALRDLTR QHGVMLIFDE
VITGFRHALG GYQSIVGVTP DLATFAKAMA NGFPIAALAG RAELMDRFAP GGGVFFAGTY
NGHSIGVAAA LATIAELESG EVHAHCFALA QIAADGLMQI AAELGIPLTV ARFGSVFVPY
FMEPAPIENY TDLFRNNTAR DLWFRKTMCE HGIFMIPTAL KRNHVSAAHT RADIDRTLEI
ARQVLRAMPA DLERSARYP