Gene Franean1_3977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3977 
Symbol 
ID5672338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4762660 
End bp4764357 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content73% 
IMG OID641242856 
Producthypothetical protein 
Protein accessionYP_001508273 
Protein GI158315765 
COG category 
COG ID 
TIGRFAM ID[TIGR02677] conserved hypothetical protein TIGR02677 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.150808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGGGG TCGCGGCGTC GGAGCGGGCA GGGGGCCGCG ACCGGCCCGC GAACGTCTCT 
CGGTTCGGGC CCTTCGCCCA TGTGACAGTG GAGAAGGCCC CGCTGTACCG CGCCATCATG
CGCACGTTCG TCCGGGCGAA GCGCAGGTTC AACGTCCACC TGCGCCCCGA GGACGTCCTG
GAAATGCTCG CCCGCGAGCC GGGCCAGGAC GGGGACGGGC TGGACGGCCT GGACCTCGGC
GGCATCACGG ATGTGAGCGC CGAGCTACGC AGCCTGGTCG ACTGGGGCAA CCTGCGCGGC
GATCCGGACA CGAGCCGGGT CACCTCCGTC GAGGACTTCA ACCGCCCCCG ATTCCTCTAC
CAGCTCACGC CCGCCGGCGA GGCCACCGAG ACCGCGCTGG AGGCTTTTGA CGAGGCGTTG
GGCCGGCGCG GGGAGCTGCA GGCCGTCGCC CTGTCCGACA TCCACAGCCA GCTGCGGGCG
CTGCTAGCCC TGCTGCCGGA ACCGGAGATC GATGCGGCGA AGGCGCACCA GCTGCTGCGT
GACCTGGCCG GGGTCTTTCG CGGGCTGGCG GACAACGCCC AGGCGTTCAT GGGGTCGCTG
CAGCGCACCA TCGATCTGCA CGACGCCGAC CTCGACGTCT TCCTGGCCTA CAAGGAACAG
CTCATCGACT ATCTCCAGCG GTTCATCCGC GACCTCGTCG TCACCTCGGC GCGGATCGTC
GACGTCCTGC GGGAGATCGA GGAGCACGAC CTGTCCCGCC TGCTGCGGGC GGTCGCCGAG
CGCGAGGCCC GCGACGCGGC ACCCGACCTC GACCACCCGC CCCCGCCGAC ACCGTCGCAG
CCGGCTCTGA CGTCGCAGCC GGAGCTGACG CAGGGGGAGC CGGCCGAGCC GCTGGCGAGC
GGGGTCGTCG GCGATCGGCT GGCGTTGTGG ACGGAGCGGT GGGAGGGGTT ACGGCACTGG
TTCGTCGGCG ATCGGACGCA TCCGCCTCAG TCACGCCTCC TGCGGGAGCG GGCTCGGGCC
GCCGTGCCGG CGCTGCTGTC GGTGGTGCTG GTCCTCAACG AGCGCAGGTC GGGGCGCAGC
GACCGGTCGG CGGACTTCCG TGCGCTCGCG CTCTGGTTCG CGCAGGCCCC GAACGACGAC
GACGCCCACC GGCTATGGCG GGTGTGCTTC GGGCTTGCTC CGGCCCGCCA CCTCACGGTC
GACGCGGCGA CCGTGGAGGC CCGTGACGCC GAGCCGGTGC CGGCGTCGAC GCCGTGGAGC
GAGGCGCCTC CGATTCTCGT CGACATCCGG CTGCGGCGCA CCGGGCGGTA CGAACGCCGG
GGCGCACCCA ACCGCGTCCG CGACCGCGCG GCCGAGCGCC GGCTGCTGGC CGAGAGACTG
GCGGCCGAGG AGGATGTGGT GCGGGCCGCG CGCCGCCAGC TCGCCACCGG CGAACCCGTG
CGCCTGTCCG AGCTGGCCGA GCTCGACGAC CCGACCTTCC GACTGTTCCT GACCGTCCTC
GGGGACGCCC TCGCCCGCCG CCGGCCGGGG GAACTGAGCG TCGTGACCAG CAGCGCCGAC
GGCACTCTGT CCATCTCGCT CACGCCCACC GACGACGGTG TGATCGCGAC CGTGCCCACC
AGCGCGGGCC TGTTCACCGG CCCCGACCAT CTCCTGGAGA TCCACGACTT GCTGACAGCG
ACGGCGGTGA CGGCGTGA
 
Protein sequence
MAGVAASERA GGRDRPANVS RFGPFAHVTV EKAPLYRAIM RTFVRAKRRF NVHLRPEDVL 
EMLAREPGQD GDGLDGLDLG GITDVSAELR SLVDWGNLRG DPDTSRVTSV EDFNRPRFLY
QLTPAGEATE TALEAFDEAL GRRGELQAVA LSDIHSQLRA LLALLPEPEI DAAKAHQLLR
DLAGVFRGLA DNAQAFMGSL QRTIDLHDAD LDVFLAYKEQ LIDYLQRFIR DLVVTSARIV
DVLREIEEHD LSRLLRAVAE REARDAAPDL DHPPPPTPSQ PALTSQPELT QGEPAEPLAS
GVVGDRLALW TERWEGLRHW FVGDRTHPPQ SRLLRERARA AVPALLSVVL VLNERRSGRS
DRSADFRALA LWFAQAPNDD DAHRLWRVCF GLAPARHLTV DAATVEARDA EPVPASTPWS
EAPPILVDIR LRRTGRYERR GAPNRVRDRA AERRLLAERL AAEEDVVRAA RRQLATGEPV
RLSELAELDD PTFRLFLTVL GDALARRRPG ELSVVTSSAD GTLSISLTPT DDGVIATVPT
SAGLFTGPDH LLEIHDLLTA TAVTA