Gene Franean1_4583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4583 
Symbol 
ID5672930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5465574 
End bp5466689 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content75% 
IMG OID641243446 
Productacyl-CoA dehydrogenase domain-containing protein 
Protein accessionYP_001508862 
Protein GI158316354 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.277909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.943502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCA GCTTCACCGC CGAGCACGAG CAGTTGCGGC GGGCCGTCCG CGATTTCCTG 
GCGGAGCGGT CGCCGGCGGC CGCCGTCCGG CGACTGATGG ACTCGACCGC CAACCGCGAC
GACGACGTGT GGACCCGGCT GGCCGGGGAG CTCGGCCTGG TGGGCCTCGG CCTGCCCGAG
CAGTACGGCG GCTCCGGCTT CGGCGAGATC GAGGTCGGGA TCGTGCTGGA GGAGATGGGC
CGGGCGCTGC TCGTGGCCCC CTACCTGAGC ACCGCCGTGC TCGCGGGTCA GACCCTGGCG
ACGTCCGAGG ACCGGGAGGC GCAGCAGCGG TGGCTGCCGG GGATCGCGGC CGGCTCGCTG
ACCGCGACGC TGGCGGTCGC CGACGAGTCG GGATCGTGGG AGCTGACCGA CCCGGCGACG
AGCGCCGAGC CGCGCGGCGG GCAGTGGCTG GTCTCCGGCC CGAACCACTA CGTCCTCGAC
GGCCACAGCG CCGACCTGCT CCTGGTGGTG GCGCGGGCCG CGGACGGGAC GGGGGTGTTC
GCGGTCGAGG GGACGGGCCC CGGGGCCGTG CGCGCCAGGC TCGACAGCCT CGACCCCACC
CGGGACCTCG CCTCGGTCGT CCTACGGGAG GCGCCCGCGG TCCGGGTCGG CGCGGGCCGG
GAGGCGACAA CGTGGCTGGG CGAGGTCCAC GACCGGGCAC TCGCGGCCCT GGCGTCGGAA
CAGGTCGGCG GCGCGGCGCG GTGCCTGGAG CTCGCCGTCG ACTACGCGAA GGTACGGGAG
CAGTTCGGCC GGCCGATCGG CTCGTTCCAG GCGATCAAGC ACAAGTGCGC CGCCCTGCTC
GTCGAGGTCG AGAGCGCCCG CTCGGCGGTC TACCACGCCA ACGCCGCACT GGCCGCCGAC
GACCCGGAGG GGACGGTCGC CGCGGCGGTC GCGTCGGCGT ACGCATCGCG GGCGTTCACC
CTCGCCGCCA AGGAGTGCAT CCAGATCCAC GGCGGCATCG GGTTCACGTG GGAGCACGAC
GCGCACCTCT TCCTGCGGCG GGCGAAGTCC TCCGAGCTGT TCTTCGGGGC ACCGCGGGCG
CAGCGCAATC GGCTCGCGGA CCTGGTCGGG ATCTGA
 
Protein sequence
MRLSFTAEHE QLRRAVRDFL AERSPAAAVR RLMDSTANRD DDVWTRLAGE LGLVGLGLPE 
QYGGSGFGEI EVGIVLEEMG RALLVAPYLS TAVLAGQTLA TSEDREAQQR WLPGIAAGSL
TATLAVADES GSWELTDPAT SAEPRGGQWL VSGPNHYVLD GHSADLLLVV ARAADGTGVF
AVEGTGPGAV RARLDSLDPT RDLASVVLRE APAVRVGAGR EATTWLGEVH DRALAALASE
QVGGAARCLE LAVDYAKVRE QFGRPIGSFQ AIKHKCAALL VEVESARSAV YHANAALAAD
DPEGTVAAAV ASAYASRAFT LAAKECIQIH GGIGFTWEHD AHLFLRRAKS SELFFGAPRA
QRNRLADLVG I