Gene Rcas_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2033 
Symbol 
ID5539511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2606015 
End bp2607319 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID640894168 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001432139 
Protein GI156742010 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT ACCGTTCTGA GTCACTGTTC GCAGAAGCGC GTTCGCTCTT CCCCGGCGGC 
GTCAACAGTC CGGTGCGCGC CTTTCGCGCC GTCGGCGGCG CGCCGCGCTT TATTGCGCGC
GGCGAGGGGG CATTTCTCGT TGATGTCGAT GGCAATCGCT ACATCGATTA CGTTCTGTCG
TGGGGACCAC TGATCCTGGG GCATGCGCAC CCCAATGTTG TTGCCGCCAT CGCCGAACAG
GCGGCGCATG GCACGTCGTT CGGCGCCCCG ACCGAACTCG AAAGCGAACT GGCACGTCTG
ATCACACAGG CGATGCCCTC GGTTGAAATG GTGCGCTTCG TCTCGTCGGG CACCGAAGCA
GCAATGAGCG CCCTGCGCCT CGCGCGCGCC GCAACCCGCC GCGACAAGGT CATCAAGTTT
GCCGGCTGTT ACCATGGGCA CTTCGACGGA TTTCTGGTGC AGGCTGGCTC CGGTGTAGCA
ACGCTTGGCT TGCCGGACAG TCCGGGGGTG ACGGCGGCAA CGGCTGCAAG TACGTTGACG
GCGCCGTATA ACGATCTTGA TGCGGTAGAG TCGCTGTTGA AGGCGAATCC CGGCGAAGTG
GCGGCGATTG CCGTCGAACC GGTTGCCGGA AACATGGGAC TGGTGCTGCC ACAACCCGGT
TTTCTCGAAG GTTTGCGCCG CTTAGCCGAC GAACATGGCG CACTGCTGAT CTTCGACGAG
GTTATGACCG GCTTTCGAGT AGGGTATGGC GGCGCACAAG GAAAGTATGG CATCACCCCT
GATCTTACCT GTCTCGGCAA GGTGATTGGC GGTGGTTTAC CGGCTGCCGC CTATGGCGGA
CGGCGCGATC TGATGGAACT GATCGCGCCC GCCGGTCCGG TGTATCAGGC AGGCACCCTT
TCCGGCAATC CGCTGGCAAT GGCGGCTGGC GCGGCGACCC TGCGGGCTAT CAGGGCGCCT
GGCGTCTTTG AGCAATTGGA ACGGGCAGCG GCGATGCTCT GTTCTGGTTT TGAGCACGCT
GCCGCCGAAG CGGACATCGC GCTGCGTACT GCTTATGCCG GCAGCATGTG GGGTTTCTTC
TTCACCGATG AACCGGTGGT CGATTATGTC TCGGCGAAGA AATCAGATAC GCAACGCTAC
GCGCAGTTCT TCCACGCGAT GCTGGAACGC GGCATCTACC TGGCGCCAGC CCAATTCGAG
GCATCTTTCG TATCGCTCGC GCATAGCGAT GCGCTCATTC AAGAGACGAT TGCCGCCGCC
GCCGACGCGC TACGATCGAT CCAGAACGCT GCTCGGAAAG GCTGA
 
Protein sequence
MKTYRSESLF AEARSLFPGG VNSPVRAFRA VGGAPRFIAR GEGAFLVDVD GNRYIDYVLS 
WGPLILGHAH PNVVAAIAEQ AAHGTSFGAP TELESELARL ITQAMPSVEM VRFVSSGTEA
AMSALRLARA ATRRDKVIKF AGCYHGHFDG FLVQAGSGVA TLGLPDSPGV TAATAASTLT
APYNDLDAVE SLLKANPGEV AAIAVEPVAG NMGLVLPQPG FLEGLRRLAD EHGALLIFDE
VMTGFRVGYG GAQGKYGITP DLTCLGKVIG GGLPAAAYGG RRDLMELIAP AGPVYQAGTL
SGNPLAMAAG AATLRAIRAP GVFEQLERAA AMLCSGFEHA AAEADIALRT AYAGSMWGFF
FTDEPVVDYV SAKKSDTQRY AQFFHAMLER GIYLAPAQFE ASFVSLAHSD ALIQETIAAA
ADALRSIQNA ARKG