Gene Franean1_6526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6526 
Symbol 
ID5674841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7933863 
End bp7935599 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content75% 
IMG OID641245374 
Producthypothetical protein 
Protein accessionYP_001510769 
Protein GI158318261 
COG category 
COG ID 
TIGRFAM ID[TIGR02231] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCCA CGGCGGACGG CACACCTGAC CTTCCTGACG CACCGGCGGG CGCACCGACG 
GGCGCGCCGC CGTCGGTGCT CCTCGACGCG CCGATCAGCG CCGTGGTCGT CTATCCCACC
GCCGCGCGCG TGACCCGCCG TGGCCGGTTC GATGTCGCGA CGCTCGGCTC GCCGTCCGAG
CGGGTCGAGG TGACTCTCAG CGGGCTGCCG CTCGAGCTCG ACGAGGACTC GGTGCGGGTC
AGCGGCACCG GCTCCGCGCG GGTGCTCGGG GTGCGGGTCG TCTCCCAGGC CCGGGCTACC
CCCGACGCCG GTGCCCTCGC CGACCTGCGC GCCCGCCAGG CCGAGCTGCG CGCGCACCTG
GCCGAGATCG GCGACGAGGA CGAGACCGAG CGGGCCCGTC GCTCCTTCCT CGACGTGGCC
GCCCGGGTGG GAGCCGGCGC CCTCGCGCGC GGCTGGGCAC ACGAAGGCAC CGACCGCGAG
GGCGACGGGC CCGGGGACGC GGCAGCCCGG CTGGTCACTG TCGGCGACGC GTTCGCGGCG
CAGATGTCGG CCGTCCACGC GCGCCGGCGG GCGCTCGCCG AGCGGCGGAA GGAGACGGAA
CGGGAGCTGA CCGTGCTCGG CCGGGTCATC GAGGCCCGCC ACGCGCGGCC CGAGCCGGAC
ACCCGCGCGA TCGTCGTCGA CCTCGAGCCG CCGACGGCCA CCGAAGGCCC GGCGGAGGGC
GTGCTGGTCG ACAGGACAGC CTCGCTGGTG GACCTGGAGG TCTCCTACCT GGTCCGTTCC
GCCTCCTGGA GATCCGGCTA CGACGCACGG CTGGACGGCG AGCGGGTCAC CCTGACCTGG
TTCGCGATGA TCAGTCAGCG CACCGGCGAG GACTGGCCCG TGACGGACCT GCGGCTGTCC
ACGGCCCGCC CGTCCAGCGG CGTCGACCTG CCCGAGCTGA GCCCGCAGTA TGTGGACATC
GCCCGCCCGC GGGTCCTTCC CCGCGGGCGG GCCAAGGCGT CCGGCGAAGG CAGGGGCGGT
GGCGGCGACG GCATGACGAC CTTCATGCCG GCGGCGATGG CGGCGCCGGC ACAGGCCCCG
CCGCCGATGG CTGCCGCCGA GGCCACGCTG GAGTCGGCCG GCCCCGCGTC GACCTACCGC
CCGCCGCGGC CGGTGGCGGT ACCCGCGGAC GGGGACCCGC ACCGCACCAC CGTGGCAGTG
ATCGAGCTCG ACGCCGTTCT CGACCACGTC ACCGTGCCCA AGCTCGCCGC CGAGGCGCTG
TTGCGTGCCG CGGTCGTGAA CACCTCGTCG CACACGTTGC TGCCAGGTAA GGCGTCGGTC
TTCCACGGCC CGGAGTTCGT CGGCACCACG CGCCTCGAGC TCGTCCCGCC CGGCGGGGAG
ATGGAGCTTC GGCTCGGGGT GGACGACCGC ATCAGGGTGG AGCGCGAGCT GGTCAGCCGG
GTCACCGGCC GGCGGGTGGT CGGCAACACC CGGCGGACGG ACGTCGTCCA CCGCACCACC
GTCACCAACC ACGCGCCGAT GCGGGCCCGG GTGACCGTGC GGGACCAGGT GCCGGTCTCC
CGGCACGAGA ACATCCAGGT CAGGGAGGTC GTGGCCGCTC CCGCCGCCAC CGAGCACACC
GATCTGGGGC TGCTTACCTG GGAGCTGGAG CTCGAGCCGG GCTCGAGCCG GGAGATCACG
CTGTCCTACC GGCTCGAGCA CCCTCGCGGC GTGGAGATCA CCGGCTGGGG CGACTGA
 
Protein sequence
MMSTADGTPD LPDAPAGAPT GAPPSVLLDA PISAVVVYPT AARVTRRGRF DVATLGSPSE 
RVEVTLSGLP LELDEDSVRV SGTGSARVLG VRVVSQARAT PDAGALADLR ARQAELRAHL
AEIGDEDETE RARRSFLDVA ARVGAGALAR GWAHEGTDRE GDGPGDAAAR LVTVGDAFAA
QMSAVHARRR ALAERRKETE RELTVLGRVI EARHARPEPD TRAIVVDLEP PTATEGPAEG
VLVDRTASLV DLEVSYLVRS ASWRSGYDAR LDGERVTLTW FAMISQRTGE DWPVTDLRLS
TARPSSGVDL PELSPQYVDI ARPRVLPRGR AKASGEGRGG GGDGMTTFMP AAMAAPAQAP
PPMAAAEATL ESAGPASTYR PPRPVAVPAD GDPHRTTVAV IELDAVLDHV TVPKLAAEAL
LRAAVVNTSS HTLLPGKASV FHGPEFVGTT RLELVPPGGE MELRLGVDDR IRVERELVSR
VTGRRVVGNT RRTDVVHRTT VTNHAPMRAR VTVRDQVPVS RHENIQVREV VAAPAATEHT
DLGLLTWELE LEPGSSREIT LSYRLEHPRG VEITGWGD