Gene Rcas_4447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4447 
SymbolproA 
ID5541960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5714901 
End bp5716172 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content64% 
IMG OID640896545 
Productgamma-glutamyl phosphate reductase 
Protein accessionYP_001434481 
Protein GI156744352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0014] Gamma-glutamyl phosphate reductase 
TIGRFAM ID[TIGR00407] gamma-glutamyl phosphate reductase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0509531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC TTGAAGAGAT TGGCGCGCGC GCCCGCGCTG CCGGGAGGCG CCTGGCATTG 
ATGCCGACGG AGCGTAAGAA TGCGGCGCTC GAAGCAATTG CAGCGGCGCT GCTCGACGAA
GCCAATGCGG CTGAGGTGCT GGCTGCCAAT GCTGATGATG TCGCCGCCGG GCGCGATGCC
GGGCTGTCAC CTGCCCTGAT CGACAGGATG ACGCTGACGC CGCAGCGCCT TGCTGCGATT
GCCGCCGATA CCCGCACCGT TGCCGGACTG CCCGATCCGG TGGGTGAGCG TTTCGATGCG
ACCGTGCTGG AGAACGGACT GCGGGTGCAC AAACGTCGCG TGCCGCTCGG CGTTGTCGGC
GTTATTTACG AGGCGCGCCC CAATGTGACG GTCGATGTCG CTGCGCTCTG CCTGAAATCG
GGCAATGCAG CGATTCTGCG CGGCGGTAAG GAGATCACCC GATCCTGCGC GGCGCTGACG
CGCTTGATCC AGAACGCTCT CGCGCAGACC GGGCTTCCCG CCGATGCTAT TCAGGTGATC
GACAACCCGG ACCGCGCGCT GGTCGAGCAG TTGCTGCGCC TTGATCGCTA CGTCGATGTC
ATTATCCCGC GCGGCGGTGC GGGGCTGCAC CGTTTCTGCC GCGAGAAGGC AAGCATCCCG
GTGATTACCG GCGGCATTGG TGTGTGCCAC ATCTACGTCG ATCAGGCGGC TGACCTGGAG
ATGGTCGTTC CTATCGTCCA CAACGCCAAG GTGCAACGTC CGAGCGTCTG CAACGCGCTC
GACACGCTCC TGGTGCATCG CGCGGTCGCA GCCGAGATGT TGCCGGCGGT TGCCCGCGAT
CTTCTCGCCA GCAACGTTGA ACTGCGCGTT GATGAAGAAG CCATGGCGCT CCTGCGCGCC
GCAGGGTTCG ACACTCCGCA GATCGTCCCT GCACAGGAGA GCGATTTCGG CGTAGAGTTC
ATGGCGCTGA TCCTCTCCAT TCGCGTTGTG GCGGGGCTGG ACGAGGCGCT GGAGCATATT
GCGCGCTTCG GCGACCATTC GGACGCGATT ATCACCCGCG ATCCGGCGAC GGCGGAAGCG
TTTGTGCAGG CGGTCGACTC GTCGGCGGTA TTCGTCAATG CTTCGACCCG CTTCAACGAT
GGCGGGCAAC TGGGGTTAGG CGCCGAGATT GCGATCAGCA CCCAGAAACT TCATGCGCGC
GGACCGATGG CGCTGCGTGA ACTGACTTCC TACAAATGGG TGGTGGAAGG TGATGGACAC
GTGCGCGCCT GA
 
Protein sequence
MTNLEEIGAR ARAAGRRLAL MPTERKNAAL EAIAAALLDE ANAAEVLAAN ADDVAAGRDA 
GLSPALIDRM TLTPQRLAAI AADTRTVAGL PDPVGERFDA TVLENGLRVH KRRVPLGVVG
VIYEARPNVT VDVAALCLKS GNAAILRGGK EITRSCAALT RLIQNALAQT GLPADAIQVI
DNPDRALVEQ LLRLDRYVDV IIPRGGAGLH RFCREKASIP VITGGIGVCH IYVDQAADLE
MVVPIVHNAK VQRPSVCNAL DTLLVHRAVA AEMLPAVARD LLASNVELRV DEEAMALLRA
AGFDTPQIVP AQESDFGVEF MALILSIRVV AGLDEALEHI ARFGDHSDAI ITRDPATAEA
FVQAVDSSAV FVNASTRFND GGQLGLGAEI AISTQKLHAR GPMALRELTS YKWVVEGDGH
VRA