Gene Franean1_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3122 
Symbol 
ID5671500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3677186 
End bp3678751 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content65% 
IMG OID641242019 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001507439 
Protein GI158314931 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.680526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCG GGCCGCTGAG TCAAATGGCG CAGCCCACGG TATTATTCGA AAGGAACACC 
TCATTGTCCG ACGTCACCAT CAGACACTGG GTTGATGGCC AGCCGTTCAA CGGCACCGGC
ACACGATGGG CCGAGGTCAC CAATCCGGCC ACGGGCCACG TGACCGGCCG AGTGGCACTC
GCGTCGAGCA AGGACGCCGA TCACGTCATC GCAGTGGCGG CGCGAGCCGC CGGGTCGTGG
AGTCGTACCT CCCTGGCCCA GCGCACACGG ATCCTGTTCG CCTTCCGTGA GCTGCTCGAC
TCCCGCCGGG ACGAACTCGC CGCCATCATC ACGGCGGAGC ATGGCAAGCT GCCCTCGGAC
GCCCTCGGGG AGATCGCCCG TGGTCAGGAG GTCGTCGAGT TCGCCTGCGG CGTGTCGCAC
CTACTCAAGG GCGGGCACTC CGAGTCCGTC TCGACCGGTG TCGACGTCCA TTCCAAGCGT
GACCCCCTGG GCGTCGTCGG CATCATCTCG CCGTTCAACT TCCCCGCCAT GGTGCCGATG
TGGTTCTTCC CACTCGCCAT CGCCACCGGC AACACAGTCG TCCTGAAGCC AAGTGAGAAG
GACCCGACGG CCGCGCTCTG GATCGCCGAC CTCTGGAAGC AAGCCGGATT ACCGGACGGG
ATCTTCAACG TGCTGCAGGG GGACAAGGAG GCTGTCGACG CGCTCATCGA GAGCCCGGTC
GTGCAGTCCA TTAGCTTCGT TGGGTCCACC CCCGTCGCCC AGTACGTCTA CGAGGCATCA
TCTAGGCACG GCAAGCGCGT GCAGGCGCTC GGCGGTGCTA AGAACCACAT GATCGTCCTT
CCTGACGCAG ATCTCGACTT GGCGGCCGAC GCCGCGGTGA ACGCGGGTTA CGGCAGCGCC
GGCCAGCGCT GCATGGCCGT TAGCGTCCTC GTGGCCGTGG GCGAGATCGC CGACGACCTC
GTGGCCAGGA TCGCCGATCG GACGAGGACA CTGGTCGTCG GCGACGGCGC CGAAGCCGAC
ATGGGGCCGC TGATCACCCG CGCTCACCGT GACCGGGTCG CCTCCTTCGT CGACGCCGGC
GAGCAGGACG GGGCAGCTAT CGAGGTCGAC GGCAGGGATG TGCAAAAGGG CGGCATCCAG
GATGGGTTCT GGCTCGGTCC TACGCTGTTG GATCACGTCA CACCTGCCAT GAAAGTCTAC
CAGGAGGAGA TCTTCGGCCC CGTCCTCTGC GTCGTCCGAG TAAACACGTA CGACGAGGCT
GTTGTGCTGG TCAATGGGAA TCCCTACGGC AACGGGGCAG CATTGTTCAC CAACGACGGC
GGTGCCGCGC GGCGCTTCGA GGCAGACGTG CAGGTCGGGA TGATCGGAGT GAACATCCCT
GTCCCGGTTC CCGTCGCCTA CTACTCCTTC GGAGGGTGGA AACAGTCGCT GATGGGAGAC
ACCCACGCAC ACGGAACAGA GGGGGTCCAG TTCTTCACCC GCGGCAAGGT GGTGACCACG
CGGTGGATCG ATCCGGCGAA CAGGCCACAG GGCGGGCTGG AGCTCGGCTT CCCGCGCAAT
GTGTGA
 
Protein sequence
MAGGPLSQMA QPTVLFERNT SLSDVTIRHW VDGQPFNGTG TRWAEVTNPA TGHVTGRVAL 
ASSKDADHVI AVAARAAGSW SRTSLAQRTR ILFAFRELLD SRRDELAAII TAEHGKLPSD
ALGEIARGQE VVEFACGVSH LLKGGHSESV STGVDVHSKR DPLGVVGIIS PFNFPAMVPM
WFFPLAIATG NTVVLKPSEK DPTAALWIAD LWKQAGLPDG IFNVLQGDKE AVDALIESPV
VQSISFVGST PVAQYVYEAS SRHGKRVQAL GGAKNHMIVL PDADLDLAAD AAVNAGYGSA
GQRCMAVSVL VAVGEIADDL VARIADRTRT LVVGDGAEAD MGPLITRAHR DRVASFVDAG
EQDGAAIEVD GRDVQKGGIQ DGFWLGPTLL DHVTPAMKVY QEEIFGPVLC VVRVNTYDEA
VVLVNGNPYG NGAALFTNDG GAARRFEADV QVGMIGVNIP VPVPVAYYSF GGWKQSLMGD
THAHGTEGVQ FFTRGKVVTT RWIDPANRPQ GGLELGFPRN V