Gene Franean1_2869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2869 
Symbol 
ID5671258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3384116 
End bp3385504 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content73% 
IMG OID641241778 
Producttransposase IS4 family protein 
Protein accessionYP_001507198 
Protein GI158314690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.338684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAGA GTACCGGTGT TTACCCCTCT GTCCGCATCG ACGCCGCCGG GCGGGGTGTC 
GTGTCGGCGG CGGGCAGCGT GCTGCTGACC GAGACGGTCC GCGTCAGCGG GCTGGGTCGG
TTGCTGTCCG ACGCGCTCGC GCCGTGGCGG TCACCGCTGG CGACCCATGA CCCGGCGAAG
GTACTGACCG ATCTGGCCGT GGCGCTGGCG TTGGGCGGGG ACTGCCTGGC CGACATCGCC
CAGCTACGCG CGGAGCCGGA GTTGTTCGGG CTGGTGGCCT CGGACCCGAC GGTGTCACGC
ACGATCGACC GGCTCGCCGG CGACGCCGAG AAGGTCCTGA CGGCACTCGA CCGGGCCCGC
GCGGCAGCGC GGGGCCAGGT ATGGGCGGCG GCCGGGGAAC ACGCCCCCGA CCACACGGTG
TCGGTGCACG ACCCGCTGAT CGTGGATCTC GACGCGACAC TGGTCACCGC CCATTCCGAG
AAACAAGACG CGGCGCCGAC ATTCAAACGC GGCTTCGGGT TCCACCCGTT GTGGGCGTTC
ATCGACCACG GCCAGGCCGG CACCGGGGAA CCCGCGACGG TCCTGCTGCG CCCCGGTAAC
GCCGGGTCGA ACACCGCCGC CGACCACATC ACCGTCGCCA CCGGCGCCCT CGCCCAGCTC
CCGGCCCGGC TACGCCGCTC CCGCAAGGTA CTGATCCGCG CCGACTCCGC CGGTGGGACC
CACGCCTTCC TCGCCTGGGC GCACCAGCGC AGGCTGGCCT ACTCGGTCGG GTTCACCCTG
CCCGACAACG CCGCCCAGCT GATCGGCGAG ATCCCCGCGA AGGCGTGGAC ACCGGCCTAC
GACGCCGACC GCCAGCCCCG GCCCGGCGCG TTCGTCGCCG AACTGTCCGG GCTGATGGAC
CTGACCGGCT GGCCACCGGG CATGCGTGTC CTCGTCCGCA AGGAACGCCC GCACCCCGGC
GCGCAGCTGC GTGTCACCGA CGTCGACGGC AACCGGATCA CCGCGTTCGC GACCAACGCC
GCCCGCGGGC AGCTCGCCGA CCTCGAACTA CGCCACCGCC GCCGGGCCCG CTGCGAGGAC
CGTATCCGAA CCGCCAAGGA CACCGGTCTG ACCAACCTGC CGCTGCACGA CTTCACCCAG
AACCAGATCT GGTGCGCCCT TGTCGCACTC GCTCTCGATC TGATCGCCTG GACCCAGATG
CTCGCCCTCG CCGGCCAGCC CGCGCGACGC TGGGAACCGA AACGGCTACG CCACCGCCTG
TTCTGGCTCG CAGGCCGGCT CGCCCACCAC GCCCGCCGCA GCACCCTGCA CCTGCCCCCG
CACCACCCCT GGGCCCCCCT CGCCCTCCAG GCGATCACCA CACTGCGCGC ACTGCCCGCC
CCCGGATAG
 
Protein sequence
MVQSTGVYPS VRIDAAGRGV VSAAGSVLLT ETVRVSGLGR LLSDALAPWR SPLATHDPAK 
VLTDLAVALA LGGDCLADIA QLRAEPELFG LVASDPTVSR TIDRLAGDAE KVLTALDRAR
AAARGQVWAA AGEHAPDHTV SVHDPLIVDL DATLVTAHSE KQDAAPTFKR GFGFHPLWAF
IDHGQAGTGE PATVLLRPGN AGSNTAADHI TVATGALAQL PARLRRSRKV LIRADSAGGT
HAFLAWAHQR RLAYSVGFTL PDNAAQLIGE IPAKAWTPAY DADRQPRPGA FVAELSGLMD
LTGWPPGMRV LVRKERPHPG AQLRVTDVDG NRITAFATNA ARGQLADLEL RHRRRARCED
RIRTAKDTGL TNLPLHDFTQ NQIWCALVAL ALDLIAWTQM LALAGQPARR WEPKRLRHRL
FWLAGRLAHH ARRSTLHLPP HHPWAPLALQ AITTLRALPA PG