Gene Franean1_2409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2409 
Symbol 
ID5670805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2862760 
End bp2864148 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content72% 
IMG OID641241326 
Producttransposase IS4 family protein 
Protein accessionYP_001506747 
Protein GI158314239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.795016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAGA GTACCGGTGT CTACCCGTCT GTCCGCATCG ACGCCGCCGG GCGGGGTGTC 
GTGTCGGCGG CGGGCAGCGT GCTGCTGACC GAGACGGTCC GCGTCAGCGG GCTGGGTCGG
TTGCTGTCCG ACGCGCTCGC GCCGTGGCGG TCGCCGCTGG CGACGCATGA CCCGGCGAAG
GTACTGACCG ATCTGGCCGT GGCGCTGGCG TTGGGCGGGG ACTGCCTGGC CGATATCACG
TTGCTGCGCG CGGAGCCGGC ACTGTTCGGG CTGGTGGCCT CGGACCCGAC GGTGTCACGG
ACGATCGACC GGCTCGCCGG CGACGCCGAG AAGGTCCTGA CGGCACTCGA CCGGGCCCGC
GCGGCAGCGC GGGGCCAGGT ATGGGCGGCG GCCGGGGAAC ACGCCCCCGA CCACACGGTG
TCGGTGCACG ACCCGCTGAT CGTGGATCTC GACGCGACAC TGGTCACCGC CCATTCCGAG
AAACAAGACG CGGCGCCGAC ATTCAAACGC GGCTTCGGGT TCCACCCGTT GTGGGCGTTC
ATCGACCACG GCCAGGCCGG CACCGGGGAA CCCGCGACGG TCCTGCTGCG CCCCGGTAAC
GCCGGGTCGA ACACCGCCGC CGACCACATC ACCGTCGCCA CCGGCGCCCT CGCCCAGCTC
CCGGCCCGGC TACGCCGCTC CCGCAAGGTA CTGATCCGCG CCGACTCCGC CGGTGGGACC
CACGCCTTCC TCGCCTGGGC GCACCAGCGC AGGCTGGCCT ACTCGGTCGG GTTCACCCTG
CCCGACAACG CCACCCAGCT AATCGCCCGC CTCCCGAAGA AGGCGTGGAC ACCGGCCTAC
GACGCCGACC GCCAGCCCCG AACCGGCGCT TTCGTCGCCG AACTGTCCGG CCTGATGGAC
CTGACCGGCT GGCCACCGGG CATGCGTGTC CTCGTCCGCA AGGAACGCCC GCATCCCGGC
GCGCAGCTGC GGATCACTGA CGTCGACGGC AACCGGATCA CCGCGTTCGC GACCAACAGC
GTCCGCGGGC AGCTCGCCGA CCTCGAACTA CGCCACCGCC GCCGGGCCCG CTGCGAAGAC
CGCATCCGAG CCGCCAAGGA CACCGGCCTG ACCAACCTGC CTCTGCACGA CCTCGACCAG
AACAGGGTCT GGTGCCAGAT CGTCGCGCTC GCCTGCGACC TGATCGCCTG GACACAGATG
CTCGCCCTCG CAGACCAGCC AGCCCGACGC TGGGAACCGA AACGGCTACG CCACAGCCTG
TTCTGGCTCG CGGGACGGCT CGCCCACCAC GCCCGCCAGA GCGTGCTCCA CCTCGCCGCC
CACCACCCCT GGACACCACT CGCCATCCAG GCGATCACCA CGCTGCGCGC CCTACCCGCT
CCCGGATAG
 
Protein sequence
MVQSTGVYPS VRIDAAGRGV VSAAGSVLLT ETVRVSGLGR LLSDALAPWR SPLATHDPAK 
VLTDLAVALA LGGDCLADIT LLRAEPALFG LVASDPTVSR TIDRLAGDAE KVLTALDRAR
AAARGQVWAA AGEHAPDHTV SVHDPLIVDL DATLVTAHSE KQDAAPTFKR GFGFHPLWAF
IDHGQAGTGE PATVLLRPGN AGSNTAADHI TVATGALAQL PARLRRSRKV LIRADSAGGT
HAFLAWAHQR RLAYSVGFTL PDNATQLIAR LPKKAWTPAY DADRQPRTGA FVAELSGLMD
LTGWPPGMRV LVRKERPHPG AQLRITDVDG NRITAFATNS VRGQLADLEL RHRRRARCED
RIRAAKDTGL TNLPLHDLDQ NRVWCQIVAL ACDLIAWTQM LALADQPARR WEPKRLRHSL
FWLAGRLAHH ARQSVLHLAA HHPWTPLAIQ AITTLRALPA PG