Gene Rcas_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1155 
SymbolglyA 
ID5538621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1494384 
End bp1495694 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content60% 
IMG OID640893287 
Productserine hydroxymethyltransferase 
Protein accessionYP_001431270 
Protein GI156741141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATGC TTCAGACCCT CTGGCGCAGT GATCCTGCCG TTGCGCGCAT TATCGATGGC 
GAGATGCGCC GTCAGCGCGA CGGATTGGAA CTGATTGCCA GCGAAAACTA TGCCAGTCGC
GCCGTGATGG AAGCACAGGG TTCAGCGCTC ACGAACAAAT ATGCCGAAGG ATATCCGGGC
GCACGCTACT ACGGCGGCTG CGAATGGGTC GATCAGGTGG AAGACCTGGC GCGCGCGCGG
GTCAAAGAGT TGTTCGGCGC AGAATATGCA AATGTGCAGC CGCACTCCGG GTCACAGGCG
AACATGGCCG TCTACTTCAC TTTTCTGCGA CCCGGTGATA AGGTGCTCGG CATGAATCTG
GCGCACGGCG GGCACCTGAC TCATGGCTCC CCGGTTAACT TTTCGGGTCA GTTGTACACC
TTCGTGGCGT ATGGCATCGA TCCCAAGACC GAACGGATCG ATTACGATCA GGTGGCAGAG
ATTGCGCGCC GCGAGCGCCC CAAAATGATC ACGGTCGGCG CCAGCGCCTA TTCGCGTGCC
ATCGATTTTG CCATCTTCCG TCAGATCGCC GATGAAGTCG GCGCGTTTCT CTTCGCCGAT
ATTGCGCACC CTGCCGGGTT GATCGCCAAA GGGTTGCTGC CTAGCCCCAT CCCCTACGCT
CACGTCGTTA CCTCGACCAC CCACAAGACG CTGCGCGGGC CACGCGGCGG CATCATCATG
ATGGGGAAGG ACTTTGAGAA CCCATTCGGG TTGAAGGCAG CGAAGAGCGG TCGCACCCTG
ATGATGTCGG AACTGCTCGA CAAAATGGTC ATCCCCGGTG TGCAGGGCGG TCCCTTGATG
CACGTCATCG CTGCCAAAGC GGTCGGATTC GGCGAAAACC TGCAACCGGA GTTCGAGACG
TATGCCCGTC AGATTATCCG CAATGCGCAG ACACTGGCAG GCGCCCTGAT GGCGCGTGGA
TACCACATCC TCTCCGGCGG CACCGACAAC CACCTGATGC TTATCGACCT GCGCAACAAG
GGAGTGAGCG GCAAGGCGGC GCAGGAGGCG CTCGACCGCG CCGCCATCAC GACCAATAAG
AATGCCGTCC CCAACGACGA CAAATCGCCA TTGATCACCA GTGGCATTCG GCTGGGAACC
CCTGCACTGA CCACCCGCGG CATGAAGGAA CCGGAGATGG AACAGATTGC GGCACTGATC
GACGATGTCA TCACGCATAT CAATGACGAT CATACCATCA ATCGGGTGCG CGAAGAGGTG
TTCGCGCTCT GCGCGCGCTT CCCGGTGCCG GGGCTGGAAC CATCCGCCTG A
 
Protein sequence
MSMLQTLWRS DPAVARIIDG EMRRQRDGLE LIASENYASR AVMEAQGSAL TNKYAEGYPG 
ARYYGGCEWV DQVEDLARAR VKELFGAEYA NVQPHSGSQA NMAVYFTFLR PGDKVLGMNL
AHGGHLTHGS PVNFSGQLYT FVAYGIDPKT ERIDYDQVAE IARRERPKMI TVGASAYSRA
IDFAIFRQIA DEVGAFLFAD IAHPAGLIAK GLLPSPIPYA HVVTSTTHKT LRGPRGGIIM
MGKDFENPFG LKAAKSGRTL MMSELLDKMV IPGVQGGPLM HVIAAKAVGF GENLQPEFET
YARQIIRNAQ TLAGALMARG YHILSGGTDN HLMLIDLRNK GVSGKAAQEA LDRAAITTNK
NAVPNDDKSP LITSGIRLGT PALTTRGMKE PEMEQIAALI DDVITHINDD HTINRVREEV
FALCARFPVP GLEPSA