Gene Franean1_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4250 
Symbol 
ID5672605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5063699 
End bp5065603 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content68% 
IMG OID641243123 
Producthypothetical protein 
Protein accessionYP_001508540 
Protein GI158316032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACAGCG GTTCCGCATG GCTAGTGAGT ACTCGGAAGG GCGAGATCGT CCGGGTCAAC 
GGCCTGGCGG GTCGGGTTGA TGCCGCTCCG GTCGGGATCA CTCGCCCGGG AGACGCCGTT
CAGGTCGTCC AGACGGACGA TCTTGTTCTC GTAGCCGCGA ACGCCTACCT GTCCAGCATC
GATCCACGGC TGTTGGCCCC GACCCGGGGC GCAAGCCTGC AATCGGCCGG GGCCAGGATT
CTCGCCACAG GGAAAAGGGC ATATGTGCTC GATCCTGCCA GCGAGACTGT CTACCGGCTT
GATCCTCGGA CACTGCGACC CGCCGGTCCG ATGGTTGCCC TGCCTGGCCG CGCGGGTGAC
ACGGCGCTCC AGGAAGGCGG TGAACGGCTC TGGGTGGCCC TGCCCGACCT GGGCGCCCTG
ACGCTCGTCG AGAACGACGT GGCCGGTATG CCAGTGCCGG CCGGAGCACC CGGGCCGCAT
CTGGTGTTCG CCCGGGTGGC CGGCCAGACC TGGGTTCTTA ATGGCAACGA CGGGACCGCG
GGACAGCTGC GGGCCGACGG CGCCACGAGC CGCCGAATAC AGCTCGGCGC CGACTACGAG
GGCACCGCGC TGCTTCCCGC CGCGGGGGAC AGCCCGCTGC TGGTTTTCGC CCTGCCAGGT
TCACGCCGGC TCGCGGTGCT CGAACCCGGT CGCGAACAGC CTCGGATCGA GAGCATTCCG
GTCGCGGTCG GGCCGCTGGG CGCACCACTC GTCGCCGACG GGCTGGTCTA CGTCCCGGAT
GAAGGCAGCG GACGCCTCGC TGTATACGAC CTGGCACGCC ATGAGTTCAA ATCGCCCATC
TCGGTGACGG CGACCGCCGC CACCGACCTC GAGCTCTTCC GGGCCGGCGG GATGGTGTGG
GCCAATGCCA TGTCTACGCC GGATGCGGTG GCAGTGTATG ACGGAGTCGT CCACCGAATC
GTCAAGTACG GGGAACCGGC GGCACCCCCG ATCGCTCCGT CGCCGACCAG AACACCTGCC
GCCACCGCTC CGCCGAATCC GAATCCGAAT CCGAAAACAC CGGCCAGCGC GATCCCCACG
CGTTCGGTGC CTGGTACGCC GTCAGCGGGG CCGGCTGCGG GCGGCGCCGG CCCGTCATCC
GCAACACCTC CAGCCCGGGC CACCTCACCC GCCCGGGGTG GCGACGCCGG GCCCTCCGGT
GAGGACGGAG CGGACGACGC GGCCACCGGC CCGCCCGAGA GAGTCCCCGA CCTCACCGGC
GACGACTCGA CGGACTCGGC GCGCCGGCCG GGCGAGAACG GTGGCATTCC GATCACTCGT
CGGAATGGGC CGTATACGCA CCAGACCGCG TTTGGTCGGG TCGTCGTCGA GGCCGGAAAC
GGCTACGTGG ATGTTTCGTG GGAGCTTCCG GCTGGCGAGG GCGGCCCAAC CGACCTCGCG
TGGATGGCCG GTGCCCGGGG TGGCGGTGGT GCCGGCAACG GCGGGCCGCT GCCGCCCGGC
AGCACCTCAA CGCGCTTCAA CGTCACCTAC ACCGGCGACA CGCCGACCCT GACTTTCTCA
TCGGCCGCGA ACACCTTCTC CTTTGATATC CGGGCCTGGG AGCTGTGTGA CTTCTGCAAC
TACGGACACC CCACCTATAC GGTGCCGCTT CGCGACGCCC CACACGGTAC CCCGATCGGG
ACGGCGCTTC CTCCGGTACC GGACGGGCAG ATGGGAAATC AGGTCGAACT GCACTGCGTG
GCGGAATCCA CCGTCGAATA CGCTGACCCG GTCTATCCCG GTCGGGTTGG CACATTCGCC
TGGTACAAGA TTACCTATCA GGGGGCCACG GGATATGTGC CGACGAACTA TATCAGCATT
CCCGACACTG GGATGGAACC ACGATACGCG GTGCGTCCCT GCTGA
 
Protein sequence
MYSGSAWLVS TRKGEIVRVN GLAGRVDAAP VGITRPGDAV QVVQTDDLVL VAANAYLSSI 
DPRLLAPTRG ASLQSAGARI LATGKRAYVL DPASETVYRL DPRTLRPAGP MVALPGRAGD
TALQEGGERL WVALPDLGAL TLVENDVAGM PVPAGAPGPH LVFARVAGQT WVLNGNDGTA
GQLRADGATS RRIQLGADYE GTALLPAAGD SPLLVFALPG SRRLAVLEPG REQPRIESIP
VAVGPLGAPL VADGLVYVPD EGSGRLAVYD LARHEFKSPI SVTATAATDL ELFRAGGMVW
ANAMSTPDAV AVYDGVVHRI VKYGEPAAPP IAPSPTRTPA ATAPPNPNPN PKTPASAIPT
RSVPGTPSAG PAAGGAGPSS ATPPARATSP ARGGDAGPSG EDGADDAATG PPERVPDLTG
DDSTDSARRP GENGGIPITR RNGPYTHQTA FGRVVVEAGN GYVDVSWELP AGEGGPTDLA
WMAGARGGGG AGNGGPLPPG STSTRFNVTY TGDTPTLTFS SAANTFSFDI RAWELCDFCN
YGHPTYTVPL RDAPHGTPIG TALPPVPDGQ MGNQVELHCV AESTVEYADP VYPGRVGTFA
WYKITYQGAT GYVPTNYISI PDTGMEPRYA VRPC