Gene Rcas_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1551 
Symbol 
ID5539027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1990917 
End bp1992527 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID640893689 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001431662 
Protein GI156741533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0164478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.383881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTC TGCTCTACGA TACGACTCTC CGCGACGGAA CGCAGCGCGA AGGGTTATCG 
CTTTCCGTCG AAGACAAACT GAAGATCGCG CGCGAACTCG ACCTGCTCGG CGTTCACTAC
ATCGAAGGCG GATGGCCCGG CTCGAATCCC AAAGATGCCG AGTTCTTTCA GCGCATCCGC
CGTGCCGACC TGCGCCACGC GAGAGTCGCC GCCTTCGGCA GCACCCGCCG CGCCGACGCC
ACCTGTGATA CCGACGCCAA TATCCAGGCG CTTGTTGCTG CCGAGACGCC GGTGGTAACG
CTCGTCGGCA AGAGTTCGAC CCTGCATGTC GAGCAGGTGC TCGAAACGAC ACGCCAGGAG
AATCTGGCAA TGATCGCCGA AAGCGTCGCA TATTTCAAGG AGCGCGGTAA GGAGGTCGTC
TACGACGCGG AGCACTTCTT CGACGGCTAT AAACTGGATG CCGCGTATGC GCTGGCAACC
CTGACCGCAG CCGCGCACGC CGGCGCCGAC TGCCTGGTGC TGTGCGATAC GAATGGCGGC
AGCCTGCCGC ACGAAGTGAC CGAAATCGTG CAGGCGGTGC AGCGCCGCCT GGCAAACGAA
GGGTTTAGCA ACGGCGCAGC AGGGCGTGGT CCTACCCTCG GCATCCACAC CCACAACGAC
GGCGCGCTGG CAGTCGCCAA CGCGCTTGCC GCCGTGCGCG CAGGATGCGT CCACGTCCAG
GGCACGATCA ACGGCTATGG CGAACGGTGC GGCAATATGG ACCTGATCCC GCTCATCGCT
AATCTGCAAC TGAAACTCGG CTACCGCTGC ATTACCCCCG AACAACTTCG CCGCCTGACC
GAGGTGTCGC ACTATGTCGC CGCCGTCGCC AACCTCAACC CCGACACCCA CGCGCCATTC
GTTGGACACT CCGCATTCGC GCACAAAGGT GGCATCCACG TTGCCGCCGT CGCCAAAGTG
CCGGACAGTT ACCAGCACAT CGACCCGGAA CTGGTCGGCA ATCGGATGCG GGTGGTCGTC
AGTGAACTGT CGGGGCGCGG GAATGTGCGG ATGCGCGCGC AGGAACTGGG GCTTGATCTG
AACGGGAACG AACGGGTGGT ATTGCAACGG ATCAAAGAGC TGGAAAACCG TGGCTTCCAG
TTCGAGGCGG CGGAGGGATC GTTCGAGATG CTGGTGCGCC GCGCTGCGCC CGACTACGAA
CCGCCGTTCG AGTTGCTCGA TTTCACGGTG GTCGTCAGCA AACATGGCGC TGGCGACATA
ATCTCGCAGG CAATGGTCAA GCTCAAAGTC GGCAACGAGG TTATGCACAC CGCAGCCGAA
GGCGACGGAC CGGTCAATGC GCTCGATAAG GCAATCCGCA AGGCGCTGCT GCCACACTAC
CCCGAACTCG CCGATGTGCA ACTGGTCGAC TACAAGGTGC GCATCGTCGA TGAACACCTC
GGCACCGCCG CCAGACCGCG CGTGCTGATC GAATCGGCGC GCGGCGAAGA ACGCTGGAGC
ACGGTCGGCT GCTCGGAAAA TATTATTGAA GCCAGTTTTA TGGCGCTGTG GGACAGCCTG
GAACTCCCGC TGGCGCGACG GCGGGCTTCG TGCAATGCGC ACCGTTCCTG A
 
Protein sequence
MQILLYDTTL RDGTQREGLS LSVEDKLKIA RELDLLGVHY IEGGWPGSNP KDAEFFQRIR 
RADLRHARVA AFGSTRRADA TCDTDANIQA LVAAETPVVT LVGKSSTLHV EQVLETTRQE
NLAMIAESVA YFKERGKEVV YDAEHFFDGY KLDAAYALAT LTAAAHAGAD CLVLCDTNGG
SLPHEVTEIV QAVQRRLANE GFSNGAAGRG PTLGIHTHND GALAVANALA AVRAGCVHVQ
GTINGYGERC GNMDLIPLIA NLQLKLGYRC ITPEQLRRLT EVSHYVAAVA NLNPDTHAPF
VGHSAFAHKG GIHVAAVAKV PDSYQHIDPE LVGNRMRVVV SELSGRGNVR MRAQELGLDL
NGNERVVLQR IKELENRGFQ FEAAEGSFEM LVRRAAPDYE PPFELLDFTV VVSKHGAGDI
ISQAMVKLKV GNEVMHTAAE GDGPVNALDK AIRKALLPHY PELADVQLVD YKVRIVDEHL
GTAARPRVLI ESARGEERWS TVGCSENIIE ASFMALWDSL ELPLARRRAS CNAHRS