Gene Franean1_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3495 
Symbol 
ID5671866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4158467 
End bp4159975 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content72% 
IMG OID641242383 
Product2-methylcitrate dehydratase 
Protein accessionYP_001507803 
Protein GI158315295 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.375848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC ACAAGGTCCG CGTGTATCGG AGCGCCGAGC GGCTGGCCCG CGAGGACCAG 
CTCGCCTGGA AGATCGCCGC GGTCGCCACC GACCCGGTCG AGGTCACCGC CGACGTGACC
GACATGGTGA TCAACCGGGT CATCGACAAC GCCGGGGTGG CCGCCGCGTC GCTGACCCGA
AAGCCGGTGG CCGCCGCCCG CGACCAGGCG CTGCGGCACA TCCCCGCCCC CGGGCGTGAC
GGCGCTACCG TGTTCGGCAC GCCGGCCGGC GTCCGCGTCT CCCCGGAGTG GGCAGCCTGG
GCGAACGGCG TCGCCGTGCG CGAGCTGGAC TTCCACGACA CCTACCTGGC GGCGGACTAC
TCCCACCCCG GGGACAACAT CCCGCCGGTG CTGGCCGTCG CCCAGCATCT GGGGCTGGAC
GGTGCCGCCC TGCTGCGCGG GATCGCCACC GGCTACGAGA TCCAGGTCGC GCTCGTCCGT
GGGATCTGCC TGCACCGGCA CAAGATCGAC CACATCGCGC ACCTCGGCCC GTCCGCCGCC
GCCGGCATCG GCGCCCTGCT CGGCCTGCCG ACCGAGACCG TCCACCAGGC TGTCGGACAG
GCCCTGCACA CGACCACAAC AACCCGGCAG TCACGCAAGG GCGAGATCTC GAGCTGGAAG
GCGTACGCAC CGGCGTTCGC CGGCAAGCTC GCCGTCGAGG CGGTGGACCG GGCCATGCGC
GGTGAGGGCG CGCCGTCACC GGCCTACGAG GGTGAGGACG GGTTCATCGC CTGGCTGCTC
GACGGGCCCG GCGGGCAGTA CGTCGTCTCG CTGCCCGCGG CCGGGGAGGC CAAGCGGGGG
ATCCTCGAGA CCTACACCAA GGAGCACTCC GCCGAGTACC AGAGCCAGGC GCTGATCGAC
CTGGCCCGCC GCCTCGGCCC GCGGATCGGG GACTTTGCCC GAGTCCGCTC CATTGCGATC
CACACCAGCC ATCACACCCA CTACGTGATC GGCTCCGGGG CGAACGACCC GCAGAAGTAC
GACCCGAAGG CCAGCAGGGA GACCCTCGAC CACTCGATTC CCTACATCTT CGCGGTCGCG
TTGCAGGACG GCGACTGGCA CCACGAGCGC TCCTACGCCC CCGAGCGGGC GACGCGGCCG
GACACCGTCG CCCTCTGGCG GAAGATCACC ACGCTGGAGG ACAAGGAGTG GACGCGGCGT
TACCACGCCA CCGACCCCGC CGAGAAGGCG TTCGGTGGCC GGGTCGTCGT CGAGCTCGAC
GACGACACCG TGCTCACCGA CGAGATCGCC GTCGCCGACG CGCATCCGCT CGGCGCCCGT
CCGTTCGGCC GGGACGAGTA CGTCGGCAAG TTCCGCCGCC TCGCCGAGGG CGTCATCCCC
GGCCCCGAGC AGGACAGGTT CCTCGACACC GCCGCCCGCC TGCCCGAACT GACCCCGGAC
GAACTCGCCG GCCTCACCCT CACTCCCGAG CCCGCACTCA CCACCGGGAG CACCCAGGGG
ATCTTCTGA
 
Protein sequence
MIDHKVRVYR SAERLAREDQ LAWKIAAVAT DPVEVTADVT DMVINRVIDN AGVAAASLTR 
KPVAAARDQA LRHIPAPGRD GATVFGTPAG VRVSPEWAAW ANGVAVRELD FHDTYLAADY
SHPGDNIPPV LAVAQHLGLD GAALLRGIAT GYEIQVALVR GICLHRHKID HIAHLGPSAA
AGIGALLGLP TETVHQAVGQ ALHTTTTTRQ SRKGEISSWK AYAPAFAGKL AVEAVDRAMR
GEGAPSPAYE GEDGFIAWLL DGPGGQYVVS LPAAGEAKRG ILETYTKEHS AEYQSQALID
LARRLGPRIG DFARVRSIAI HTSHHTHYVI GSGANDPQKY DPKASRETLD HSIPYIFAVA
LQDGDWHHER SYAPERATRP DTVALWRKIT TLEDKEWTRR YHATDPAEKA FGGRVVVELD
DDTVLTDEIA VADAHPLGAR PFGRDEYVGK FRRLAEGVIP GPEQDRFLDT AARLPELTPD
ELAGLTLTPE PALTTGSTQG IF