Gene Cagg_3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3350 
SymbolproA 
ID7267090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4062396 
End bp4063664 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content59% 
IMG OID643568159 
Productgamma-glutamyl phosphate reductase 
Protein accessionYP_002464630 
Protein GI219850197 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0014] Gamma-glutamyl phosphate reductase 
TIGRFAM ID[TIGR00407] gamma-glutamyl phosphate reductase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000409724 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGATC TCGAGACGAT TGGCCGGCGA GCGAAGACGG CGGCGCGCAC GCTGGCGAAG 
GTATCAACCG AACAGAAAAA TGCAGCGCTC CACGCAATTG CCGATGGGTT ATTGGCCCGC
CAAGATGAGA TTCTGTCAGC GAACGCGGCC GACGTAGCCG ATGCCGAACG AGCCGGCACA
CCGCCGGCGA TTGTCGATCG CATGCTGTTG ACCGAAGCAC GGCTAGCGAC GATTGCCAAC
GATTGCCGGA AGGTGGCAGA GTTACCTGAC CCGGTAGGTG AGATTTTTGA CCGGCGTGAA
TTGCCGTCGG GTTTGCGCTT GTACAAACGG CGAGTCCCAA TCGGCGTGAT CGGCGCCATT
TACGAGGCAC GGCCCAACGT GACGGTTGAC ATTGCTGCGC TGTGTCTCAA GGCCGGCAAT
GCCGTGATCC TCCGTGGCGG GAGCGATATT GCGCGCAGCG TGGCTGCCAC TACGACCGTC
ATTGCCGAAG CACTAGAACA AGCCGGTCTC CCGATATTTG CCGTACAGAG CATCACCGAT
CCCGACCGCG AATTGGTGCG GCAGTTGCTA CGGCTTGATC GCTACGTGGA TATGATCATC
CCTCGCGGTG GTGCTAGCCT TCATCGGTTT TGCGTCGAGA ACGCGACGGT GCCGGTAATC
GTGGGTGGGA TGGGGGTCAG TCATATTTAT GTCGAGCCGA GCGCCGATTT TGCCCGTGCT
GTGCCGGTCG TGGTCAATGC CAAAGTGCAG CGACCGGGAG CGTGCAATGC GCTCGACACC
TTGCTCGTTC ACCGCGCTGC AGCACCGGTC TTCTTACCTA TGGTCGCAGC AGCGTTAGCT
GAGTACAATG TTGAGCTACG CTGCGATCTA GAAACACTTG CTATCCTTGC CGACGCACCC
GGCCACGAGA ACTGGCAGTT GCGACCGGCA GAGCCGGCCG ATTTTGGCCG TGAGTTTCTG
GCCCTGATCG TGGCGATCAA AGTCGTGGGC GATATTGATG AAGCCCTCGA TCATATTGCC
CAGTACGGCG GCCATTCGGA AGCGATTTTG ACCGGGGATC CGGCCAGCGC GGCTCGCTTT
ACCCGCGAAG TTGACGCAAC TGCGGTGTTC GTGAATGCCA GCACCCGCTT CAACGACGGT
GGTCAATTTG GGTTAGGCGC TGAGGTGGCA ATCTCAACCA ACCGACTGCA CGCCCGTGGG
CCGATGGGTT TGCAAGAATT GACTACATAT ACGTGGATCG GTGAAGGGGA TTATTTGGTA
CGGGCATGA
 
Protein sequence
MIDLETIGRR AKTAARTLAK VSTEQKNAAL HAIADGLLAR QDEILSANAA DVADAERAGT 
PPAIVDRMLL TEARLATIAN DCRKVAELPD PVGEIFDRRE LPSGLRLYKR RVPIGVIGAI
YEARPNVTVD IAALCLKAGN AVILRGGSDI ARSVAATTTV IAEALEQAGL PIFAVQSITD
PDRELVRQLL RLDRYVDMII PRGGASLHRF CVENATVPVI VGGMGVSHIY VEPSADFARA
VPVVVNAKVQ RPGACNALDT LLVHRAAAPV FLPMVAAALA EYNVELRCDL ETLAILADAP
GHENWQLRPA EPADFGREFL ALIVAIKVVG DIDEALDHIA QYGGHSEAIL TGDPASAARF
TREVDATAVF VNASTRFNDG GQFGLGAEVA ISTNRLHARG PMGLQELTTY TWIGEGDYLV
RA