Gene Hhal_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1801 
Symbol 
ID4710946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1972853 
End bp1974157 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID639856271 
ProductFolC bifunctional protein 
Protein accessionYP_001003367 
Protein GI121998580 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGTG CCCGCAGGCC GCAGCCGGCC GCTCCGCCAG ATCTTGAGGG TTGGCTGGCC 
CGACTGGAAG CGGCTCACCC GACGGCCATC GACCTCGGTC TGGAGCGCGT GGCCGCCGTG
GGCCTGCGCC TTGGTGTCCT CGCCCCGGCG GCGCGGGTGG TTACGGTCGG TGGGACCAAC
GGCAAGGGGA GTGTGGCGCG GACCCTGGAG GCGCTGCTGC GCGGTCTCGG GGTGAGCACA
GCCCTCTACA CCTCGCCCCA TTTCCAGCGC TTTAACGAGC GGATGCGCCT CGACGGGGTC
GAGGTGGCCG ATGCCCCATT GGTCAAGGCG CTGGGCCGGG TCGAGGAGGC CCGGGGCGAC
GCTGACGTCA GTCTGACCTA TTTCGAGCAT ACGACCCTGG CCGCCTTCGA TCTCTTTGCG
CGCAGCGGTG CTCGGGTCTG GATCCTGGAG GTGGGGCTGG GCGGGCGGCT GGACGCTGTG
AACGCCATCG ACGCGGATGT GGCGGTGGTC ACCCGGATTG CCCGCGACCA CGCCGAATTC
CTCGGCGACG ACCTGCAGGC GATTGCCGCC GAGAAGGCCG GCATCTTCCG CAAGGGGCGA
CCGGCGGTGA TCGGTCAGCA GGACGCCCCG GCGGCGCTGC GTCAGTCGGC GCTGGAGGTC
GGCGCCGAGG TCGCGCAGGC CGGGGTGGAG TGGACGTTCT CCGCCGAGGC CGACGGGCGC
TGGTGGTGGC GCTGTGGCGA GTACCACTGG GGGGGGCTGC CGGCCTCGGG GATCCCGGGG
TGCGCTGCGC GCGGCAACGT GGCTACCGCC TTGGCCGCCC TCGTTCAGCT GCCGGAGGCG
GTTGGCGTGG ACGCGGCGGC CGTCGCCAGG CTCCTGGATG GGGTGCGGGT GCCCGGGCGT
CTGGAGCGGA TCGAGGCCGG GTCACTGGAG TGGCTGTTGG ATGTCGCGCA TAATGCCGAT
GGTGCGGAGG AGCTCGCCCG GGTGCTGGAT GACCGAACGG TCACCGGGCA AACGCGAGCC
CTCTTCGCCG TGGCGGCGCG CAAGGATGCC CGGGCATTGG TCGCGGCGTT GAGTGGTCGG
ATCGACGCCT GGTACCTGCC GCAGCTGGAT GAGCCGGATA TGCGCAACGC CGATGAACTG
GCCAGCCAGC TTGCCGACGC GGGTGAGTCG GTGGTGCACT GCGGTGACGG GGTGGCGACC
ACCCTGGCCG CTCTCCAGGG CCAGGCCGGG CCCGGCGATC GGGTTGTGGT CTTTGGCTCC
TTCCGCACTG TGGCCGCCGT ACAAGCGCAG CAAGGCTGGG GGTAG
 
Protein sequence
MNGARRPQPA APPDLEGWLA RLEAAHPTAI DLGLERVAAV GLRLGVLAPA ARVVTVGGTN 
GKGSVARTLE ALLRGLGVST ALYTSPHFQR FNERMRLDGV EVADAPLVKA LGRVEEARGD
ADVSLTYFEH TTLAAFDLFA RSGARVWILE VGLGGRLDAV NAIDADVAVV TRIARDHAEF
LGDDLQAIAA EKAGIFRKGR PAVIGQQDAP AALRQSALEV GAEVAQAGVE WTFSAEADGR
WWWRCGEYHW GGLPASGIPG CAARGNVATA LAALVQLPEA VGVDAAAVAR LLDGVRVPGR
LERIEAGSLE WLLDVAHNAD GAEELARVLD DRTVTGQTRA LFAVAARKDA RALVAALSGR
IDAWYLPQLD EPDMRNADEL ASQLADAGES VVHCGDGVAT TLAALQGQAG PGDRVVVFGS
FRTVAAVQAQ QGWG