Gene Franean1_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3843 
SymbolgabD1 
ID5672206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4565508 
End bp4566884 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content72% 
IMG OID641242721 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001508141 
Protein GI158315633 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.799079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCC AGTCGGTCAA TCCGGCGACG GGCGCAGTGG TCAGGACGTT CGACGCGCTC 
GACGGTGACG AGATCGAACA GCGGCTGGCC CTGGCCAGCG CGACGGCCGC CGTCTACCGC
CGGACGACCT TCGCTGAACG GGCCGCGCTG CTGCGGCGGG CCGCCGACAT TCTCGACGCG
GAACGGCACG AGATCGCCGT GACGATGACG ACCGAGATGG GCAAGACGCT GCGCTCGGCC
GAGGCCGAGG CGGCGAAGTG CGCGAAGGGC ATGCGGTTCT ATGCCGAGCA CGCGGAGGCG
TTCCTCGCCG ACGAGACGCT GGCCGACCCG GGCTCGGTCG GCGCGAGCCG GGCGTTCGGC
CGGTACCAGC CGCTCGGCGT CGTCCTCGCG GTGATGCCGT GGAACTTCCC GCTCTGGCAG
GTCGTCCGGT TCGCGGCGCC GGCGCTGATG GCGGGCAACG TCGGGCTGCT CAAGCATGCC
TCGAACGTCC CGCAGTGCGC GCTCTACCTG GAGGACCTGT TCCGCCGGGC CGGGTTCCCC
GAAGGCGCCT TCCAGACCCT GCTGATCGGC GCCGGGCAGG TCGAGGCGGT GCTGCGCGAC
CCCCGCGTCG CCGCGGCGAC CGTCACGGGC AGCGAACCGG CCGGCCAGGC CGTCGCGTCC
GTGTGCGGGC AGGAGATAAA GAAGACCGTC CTCGAGCTCG GCGGCAGCGA CCCGTTCGTC
GTCATGCCCT CCGCCGACGT CGCGCGCGCC GCCGAGGTGG CGGTCACCGC CCGCTGCCAG
AACAACGGGC AGTCCTGCAT CGCCGCGAAG CGCTTCATCG TCCACGAGGA CGTCTACGAG
CAGTTCGCCG AGCTGTTCGC CGCCGGGATG GCGGCCCTGA AGGTCGGTGA CCCGATGGAC
CCGAGCACCG ACGTCGGCCC GCTGGCGACC GAGGGCGGCC GGCTCGACAT CGAGGAGCTC
CTCGCGGACG CGGTGAAGGA GGGCGCCAGC ATCCTGTGCG GCGGGACGGC GCCGTCCGGG
TCGGGGTACT TCTTCCCACC GACCGTCGTC GGTGACGTGA CTCCGGCGAT GCGCCTGCAC
CTCGAGGAGG CGTTCGGCCC GCTGGCCACC CTCTACCGGG TGCCGGACAT CGACGCGGCG
ATCGAGCTGG CGAACGTGAC GTCGTTCGGG CTCGGCTCCA ACGCGTGGAC GACCGACCCG
GCCGAGCAGG AGCGGTTCAT CTCCGACCTG GTCGCCGGCG CGGTCTTCCT CAACGGCATG
GTCAGCTCCC ATCCGGAGCT GCCGTTCGGC GGCGTCCGCC GCTCCGGCTA CGGCCGTGAG
CTGAGCGCGG TCGGCATCCG GGAGTTCTGC AACCTCAAGA CCGTCTGGGC CGGCTGA
 
Protein sequence
MAIQSVNPAT GAVVRTFDAL DGDEIEQRLA LASATAAVYR RTTFAERAAL LRRAADILDA 
ERHEIAVTMT TEMGKTLRSA EAEAAKCAKG MRFYAEHAEA FLADETLADP GSVGASRAFG
RYQPLGVVLA VMPWNFPLWQ VVRFAAPALM AGNVGLLKHA SNVPQCALYL EDLFRRAGFP
EGAFQTLLIG AGQVEAVLRD PRVAAATVTG SEPAGQAVAS VCGQEIKKTV LELGGSDPFV
VMPSADVARA AEVAVTARCQ NNGQSCIAAK RFIVHEDVYE QFAELFAAGM AALKVGDPMD
PSTDVGPLAT EGGRLDIEEL LADAVKEGAS ILCGGTAPSG SGYFFPPTVV GDVTPAMRLH
LEEAFGPLAT LYRVPDIDAA IELANVTSFG LGSNAWTTDP AEQERFISDL VAGAVFLNGM
VSSHPELPFG GVRRSGYGRE LSAVGIREFC NLKTVWAG