Gene Haur_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1241 
Symbol 
ID5733149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1446021 
End bp1447421 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID641278381 
Productpyridoxal-dependent decarboxylase 
Protein accessionYP_001544017 
Protein GI159897770 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCCAG TTTTAGCCCA CGATGCCGCT CAAATCGACC ACATTTTAGC GCAGACCCTT 
GCCACAGCCA AACAATTTCT CCACGATTTG CCACAGCGTC CGGTTGGAGT TGCGCCGCAA
AGCCACCAAC CAAGCCAATT ACCAAGCGCT GGATTAGGAG CCGAACAAAC GCTTGAGCAT
TTTTTGGCCC GCTACAGCGA TACATTAACT GGTAGCACTG GTCCGCGCTA TTGGGGCTTC
GTCACTGGCG GCGCAACTCC GGCAGCACTG GCAGGCGATT GGCTGGTCAG CACCTTCGAT
CAGAATCCAA GTGGCACAAC CGAAACCGCC GCGATTCGGG TTGAAAACGA AGCCATCAGC
ATGTTGCGCG AACTCTTTGG CTTGCCCACA AGCTTTAGCG GCGCATTCGT TTCTGGGGCA
ACGATGGCCA ATTTTGTCGG CTTGGCGATT GGGCGGCAAT GGGCGGCTCA ACAACTCAAC
CATGATGTTG CCCGCATGGG TTTATATGGG CTTGCGCCAA TTCCAGTTTT GAGCGGAGCA
CCGCACTCAA GCATTTACAA GGCCATGTCA ATGTTGGGCA TGGGTCGTCA ACAGCTGCAA
ACAATCGCCT TACAGCCCGA ACGCGAGGCG GTTGACATCG CAGCATTACG TCAAGCACTT
CAAGCCTTGT CAGCGAACCA ACCTGCGATC GTTGTTGCCA ATGCAGGCAC GGTCAATAGT
GTCGATTTTG ACGATCTTAT GGCAATTGCC GCGCTCAAGC AGGAATTCAA TTTCTGGTTG
CATGTTGATG CGGCTTTTGG TGGTTTTGCC GCCTGTTCGC CGCGCTTTGC TCATTTAGTG
CATGGGCTTG AACAAGCCGA TTCGCTGACG ATTGATGCCC ACAAATGGCT CAATGTGCCG
TATGATTCAG CTATGCAATT CACCCGCCAT AGTGCCTTGC AAGTTGAGGT GTTTCATAAT
AGCGCGGCCT ACCTCAGCCC AATTGGCGAG AATCCAGGCT TTTTTCATCG CACGCCCGAA
AATTCACGCC GTTGGCGAGC ACTGCCAGCA TGGTTCACGC TTATGGCCTA TGGCTCGGCT
GGCTATCAAG AAATGGTTGA GCGCGATTGC GATTTAGCCC AACTGCTGGC TAGCCATATC
AGCGATTCAC CGTTGTTTCG CTTGGTCGCG CCTGTGCGCA TGAACGTCGT TTGTTTCACT
TTGGCGGGCA ATCCTGATAG CACTACAATT CAGGCTTATC TTGATGCAGT ACGGGCTAGC
GGAGCAGTTT TTATGACCGC GACTGTCTAT GCTGGACAAC CAGCGATTCG CGCGGCCTTC
TCAAATTGGC GCACCACCAC CGCCGATGTT GGGCTGGCTT GGCAGGCAAT GGAGCGAGTC
GCCATAGAAC ATAGAGCATA G
 
Protein sequence
MHPVLAHDAA QIDHILAQTL ATAKQFLHDL PQRPVGVAPQ SHQPSQLPSA GLGAEQTLEH 
FLARYSDTLT GSTGPRYWGF VTGGATPAAL AGDWLVSTFD QNPSGTTETA AIRVENEAIS
MLRELFGLPT SFSGAFVSGA TMANFVGLAI GRQWAAQQLN HDVARMGLYG LAPIPVLSGA
PHSSIYKAMS MLGMGRQQLQ TIALQPEREA VDIAALRQAL QALSANQPAI VVANAGTVNS
VDFDDLMAIA ALKQEFNFWL HVDAAFGGFA ACSPRFAHLV HGLEQADSLT IDAHKWLNVP
YDSAMQFTRH SALQVEVFHN SAAYLSPIGE NPGFFHRTPE NSRRWRALPA WFTLMAYGSA
GYQEMVERDC DLAQLLASHI SDSPLFRLVA PVRMNVVCFT LAGNPDSTTI QAYLDAVRAS
GAVFMTATVY AGQPAIRAAF SNWRTTTADV GLAWQAMERV AIEHRA