Gene Franean1_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0951 
Symbol 
ID5669365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1114622 
End bp1116142 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content72% 
IMG OID641239879 
Productgamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_001505313 
Protein GI158312805 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03374] 1-pyrroline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0829268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGA CACCGATGGA ACGGACACCC ATGGAACGGA CACCCATGGA ACGGACGCTG 
AGCAATTTCG TCGGGGGGAA GCAGGCACCG TCGCTGGACG GGCGGACGAT GCCGGTGCTC
AATCCCGCCA CCGGCGAGGT CCTCGCGCAC GCGCCGAGCT CCGGGCCGGC CGACGTGGAC
GCCGCCGTGC GGGCCGCCGG TGAGGCGTTC GAGGGCTGGC GGGACAGCAC GCCGAGCGAG
CGCTCCCGTG CCCTGCTCCG CCTCGCCGAC GCCGTCGAGG AACGGGCGGC CGAGATCGCC
GACGTGGAGT GCGCGAACAC CGGCAAGCCG CGCCAGCTCA CCCTTGACGA GGAGATCGCC
CCGTCGGCCG ACCAGATCCG GTTCTTCGCC GGGGCGGCGC GGCTGCTGGA GGGCCGGTCG
GCGGGGGAGT ACCTGGCGGG CCACACCAGC TACGTCCGGC GTGAGCCGAT CGGTGTCTGC
GCCCAGGTGA CCCCCTGGAA CTACCCGCTG ATGATGGCCG TCTGGAAGAT CGCCCCGGCG
TTGGCGGCCG GGAACACGGT CGTGCTCAAG CCCGCGGAGA CCACGCCCGC CAGCTCGCTG
CTGCTCGCCG AGATCGCGGC CGAGATCCTC CCGCCGGGTG TGCTCAACGT GGTCTGTGGT
GACCGGGACA CCGGCCGCGC CCTCGTCGCC CACCCGGGCC CGGCGATGAT TTCGGTGACG
GGCAGCGTGC GCGCGGGCAT GGAGGTCGCA CGAGGGGCGG CCGACTCGCT CAAGCGGGTA
CACCTCGAGC TCGGAGGCAA GGCCCCGGTG ATCGTCTTTG ATGACGTCGA CCCGGCCGTG
GTCGCCGCGG AGATCGCCGG GGCGGCGTAC TTCAACGCCG GCCAGGACTG CACGGCCGCG
ACCCGGGTGC TCGCCGGGCC CGGCATCGCG GGCGAGCTCG CCGACGCGCT CGCCGAGGCC
GCCCGCGCCA CCACGACCGG CCCGCCCGCC GCCGGCGGCC AGACCGGCGG CCAGACCGGC
GGGGCCGCGG CCGGCGGAGG CGAGGCCGAC TACGGCCCGC TGAACAGCGC CGGTCAGCTC
GGCCGGGTCA GTGGGTTCGT CGAGCGTGCC CCCGAGCACG CCCGCCTGCT CGCGGGCGGG
ACGCCGCTGG ACCGGCCCGG CTACTTCTAC CCGGCCACCG TGATCGCCGG CCTGCGCCAG
GACGACGAGC TGATCCAGCA GGAGGTGTTC GGCCCGGTCG TCACCGTCCA GGAGTTCTCG
GCCGAGGACG AGGCGGTCGC GTGGGCGAAC GGGGTCGAGT ACGGCCTCGC CTCCAGTGTC
TGGACGCGTG ACCACTCCCG CGCGATGCGG GTGGCGCGCC GCCTCGACTT CGGGTGCGTA
TGGGTCAACA CGCACATTTC CATCGTTGCG GAGATGCCAC ATGGAGGATT CAAGAAGAGC
GGTTACGGAA AGGACCTGTC GGTCTACGGG CTGGAGGACT ACACCCGTAT CAAGCACGTG
ATGCACAACA TCGAGTTCTA G
 
Protein sequence
MERTPMERTP MERTPMERTL SNFVGGKQAP SLDGRTMPVL NPATGEVLAH APSSGPADVD 
AAVRAAGEAF EGWRDSTPSE RSRALLRLAD AVEERAAEIA DVECANTGKP RQLTLDEEIA
PSADQIRFFA GAARLLEGRS AGEYLAGHTS YVRREPIGVC AQVTPWNYPL MMAVWKIAPA
LAAGNTVVLK PAETTPASSL LLAEIAAEIL PPGVLNVVCG DRDTGRALVA HPGPAMISVT
GSVRAGMEVA RGAADSLKRV HLELGGKAPV IVFDDVDPAV VAAEIAGAAY FNAGQDCTAA
TRVLAGPGIA GELADALAEA ARATTTGPPA AGGQTGGQTG GAAAGGGEAD YGPLNSAGQL
GRVSGFVERA PEHARLLAGG TPLDRPGYFY PATVIAGLRQ DDELIQQEVF GPVVTVQEFS
AEDEAVAWAN GVEYGLASSV WTRDHSRAMR VARRLDFGCV WVNTHISIVA EMPHGGFKKS
GYGKDLSVYG LEDYTRIKHV MHNIEF