Gene Franean1_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4230 
Symbol 
ID5672585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5037172 
End bp5038443 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content69% 
IMG OID641243103 
Producthypothetical protein 
Protein accessionYP_001508520 
Protein GI158316012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGTGA AAGTGCTCCG CGTCGAGGGC ATGCGTGAGG ACACCCGCAG TCCCGAGGAG 
AGAACAGCTC TGGAGTCGGC CGAGCGGCGT CCGGTCGACA TGAGCGTCGA CCGCGTTCTC
CGGATCGCGA GAACCCAGAC CGGCCTGGAC GACTTCGGGC CCATGGACTT CACCGAACGG
CTCGGCCGGT TGCTGGCCGA GGTGGCGGCC GACGACAACG TGTGGCGGGC CCACAAGGCG
ATCTTCGTCG ACCACTGTGT CAAGGCCGCG GTCAACCGCC TGCTCATCCA GGACTACTGG
ACGCGCCACC CCGATGCGCT CGACGTGCGG ATCACGCGGC CGATCAACGT CGTCGCGCTG
CCCCGCTCGG GCAGCACCCA CCTGGAGAAC CTGGTCGCCG CGGACCGCAG GCTCCGTCAC
CTTCCGGTCT ACCTGGCGGC CCAACCCGCA CCGCGACCGA CCGAGACATC CGAGCCTGGC
GCCCCGGACC CCCGCTGGGC CCGTTCTGAG GCGCGATGGC GCAACGTCAG CAAGAACGAG
ATCATGGCCG CGATGCACGA GCACTCGCCC GACCACGCGT GCGGCGAGAA CGAGCTGCAG
CTCCCCGACT TCGCCAGCTA CCAGTGGGAG TGGCTGGCCC AGGTGCCCGG TTTCCGTGAC
CACTACCTGA GCCACGACCA GACACCGCAC TACCGGTACA TGCGCGACGT GCTGAAGACC
ATCGCGCACC AGTTCCCGGA CGAGCGGCGC TGGATGGTCA AGTCGAACCA GCACAGCGAG
CAGCTGGTCC CGCTGCTGGC CGTCTACCCG GACGTCACGG TCGTGATGAT CCACCGGGAT
CCCGTGGCGA CGTTGCAGTC CCTGCTCACC ATGCGCGGCC TCGCGCTCAA GAACAGCCAG
AAGCAGCCGG ACATCGACGC GCACGTCGAG TACTGGGTGA ACCGCGTCGA GCAGATGCTG
CGTAGATATA TGCGCGACCG GGAGCTGGTA CCGAACGGGC AGCTCGTCGA GCTGCAGTTC
GCCGACATCA TCGCCGACGA CGTCCGGTCC GCGACGCACG TGCTCGAGCG GGCCGGCCTG
CCGGTGACCG ACGAGAGCGT CGCCGACATC CGCGCCTACA TAGCCTCCCA CCCGCGCGGA
AAGCGCGGCC GAGTGGTCTA CGACCTGGAA GGCGACTTCG GTCTCGCCGC GGACGAGCTG
CGGGAACGGT TCGCCTTCTA CACCGACGCC TTCGCCGTCG CGGCCGAGGC GGGAAAGGGC
GGCGCCCGGT GA
 
Protein sequence
MAVKVLRVEG MREDTRSPEE RTALESAERR PVDMSVDRVL RIARTQTGLD DFGPMDFTER 
LGRLLAEVAA DDNVWRAHKA IFVDHCVKAA VNRLLIQDYW TRHPDALDVR ITRPINVVAL
PRSGSTHLEN LVAADRRLRH LPVYLAAQPA PRPTETSEPG APDPRWARSE ARWRNVSKNE
IMAAMHEHSP DHACGENELQ LPDFASYQWE WLAQVPGFRD HYLSHDQTPH YRYMRDVLKT
IAHQFPDERR WMVKSNQHSE QLVPLLAVYP DVTVVMIHRD PVATLQSLLT MRGLALKNSQ
KQPDIDAHVE YWVNRVEQML RRYMRDRELV PNGQLVELQF ADIIADDVRS ATHVLERAGL
PVTDESVADI RAYIASHPRG KRGRVVYDLE GDFGLAADEL RERFAFYTDA FAVAAEAGKG
GAR