Gene Franean1_4881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4881 
Symbol 
ID5673221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5854500 
End bp5856224 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content69% 
IMG OID641243736 
Producthypothetical protein 
Protein accessionYP_001509152 
Protein GI158316644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.244213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAGAC GGACACGCAG CCCAGCGGCC GTCGAACAGG CTGACGGGCC GCCACGTCGC 
CGATCACCCG AATGCCGGCA CCCGCGGGGC TGTCGGCGTG GCGGGGGAGG TCTCCGGGCG
TTCCCGAAGC TTCCCGGCAA GAGGTTCGGA ACGACCAAGG GCGGCGTCCG GCCGACCCGG
CGGCCCGCCA GCACCTACGC TTGCAGAGTG AGCGTGCGCC GGATCATGGG GACCGAGACC
GAATACGGCG TGTCCGTGCC CGGTCAACCC AACACAAACC AGATGCTGGC TTCGTCGTTG
GTCGTGAACT CTTACGCCAA CAGGGCCGCC ACCGGGGGCA GTCGAGCTCG GTGGGACTTC
GAGGAGGAGT CCCCCCTGCG AGACGCCCGC GGCTTCGAGC TGCCCAGGGA GCCGGTGGTC
GATCCTGGCG CGCCCGCGAA CGAGGACGAC CTCGGGCTGG CCAACGTCAT CCTGACCAAC
GGCGCGCGCT TCTACGTCGA TCATGCGCAC CCCGAGTTCT CGACCCCCGA GGTGACGAAC
CCGCGCGACA TCGTCCTCTG GGACAAGGCG GGGGAGCGGG TGATGGCAGA GGCGGCCCGC
CGTGCGCTGA CTGTCCCCGG CACGATGCCC GTGCACCTGT ACAAGAACAA CACCGACAAC
AAGGGTGTCT CCTACGGCTG CCATGAGAAC TACCTCATGG CGCGGGCGAC CCCGTTCGGG
GAGATCGTGC GGCACCTGAC GCCGTTCTTC GTCTCCCGCC AGGTGATATG CGGCGCCGGC
CGGGTCGGCA TCGGCGTGGA CGGGCGTGAG CCCGGGTTCC AGATCAGCCA GCGCTCGGAC
TTCTTCGAGG TCGAGGTCGG ACTGGAGACC ACCCTCAAGC GCCCCATCAT CAACACCCGG
GACGAGCCCC ACGCCGACCC GGAGAAGTAC CGGCGTCTGC ACGTGATCAT CGGTGACGCC
AACATGAGCG AGATCGCCAC CTACCTCAAG GTGGGGATGA CCTCGCTCGT CCTGGCCATG
ATCGAGGAGG GCTGGCTCCA GGTCGACCTG TCGGTGGACG CGCCGGTCGC GACGCTGCGG
GCGATCTCCC ACGACCCCTC GCTGCGGTAT CTGCTCACCC TGCGCGACGG CCGGAAGATG
ACCGCCGTGC AGCTCCAGAT GGAGTACTTC GAGCAGGCCC GCAAGTTCGT CGAGGACAGG
CTCGGCACGG ACGTCGACCC GCAGACGGCC GACGTGCTCT CCCGGTGGGA GTCGGTGCTC
GGCCGCCTCG AGGTCGACCC GATGACCTGC TCGCGGGAGC TCGACTGGGT GGCCAAGCTG
TCGATCATCG AGGGCTACCG CTCCCGTGAC GCCCTGGCAT GGGACTCGCC GCGGCTGCAG
CTCGTCGACC TCCAGTACCA CGACGTGCGG CCGGAGAAGG GCCTCTACAA CCGCCTGGTG
GCCCGGGGCC GCTTCGACCT CCTCCTGAGT GAGGAAGAGG TCACCAGGGC GATGACGGAG
CCTCCGGAGG ACACCAGGGC GTACTTCCGG GGGCGCTGCC TGGAGCTCTA CCCGCAGCAG
GTGGCGGCCG CTTCCTGGGA CTCGGTCATC TTCGACATCG GGCGGGACTC GCTGCAGCGG
GTGCCGACCC TTGAGCCGCT GCGTGGCACC AAGGCGCACG TGGGCGAGCT GCTCGCCCGC
TGCCCCACCG CCGCCGACCT GGTGGACGCC CTGTCGGGTA ACTGA
 
Protein sequence
MRRRTRSPAA VEQADGPPRR RSPECRHPRG CRRGGGGLRA FPKLPGKRFG TTKGGVRPTR 
RPASTYACRV SVRRIMGTET EYGVSVPGQP NTNQMLASSL VVNSYANRAA TGGSRARWDF
EEESPLRDAR GFELPREPVV DPGAPANEDD LGLANVILTN GARFYVDHAH PEFSTPEVTN
PRDIVLWDKA GERVMAEAAR RALTVPGTMP VHLYKNNTDN KGVSYGCHEN YLMARATPFG
EIVRHLTPFF VSRQVICGAG RVGIGVDGRE PGFQISQRSD FFEVEVGLET TLKRPIINTR
DEPHADPEKY RRLHVIIGDA NMSEIATYLK VGMTSLVLAM IEEGWLQVDL SVDAPVATLR
AISHDPSLRY LLTLRDGRKM TAVQLQMEYF EQARKFVEDR LGTDVDPQTA DVLSRWESVL
GRLEVDPMTC SRELDWVAKL SIIEGYRSRD ALAWDSPRLQ LVDLQYHDVR PEKGLYNRLV
ARGRFDLLLS EEEVTRAMTE PPEDTRAYFR GRCLELYPQQ VAAASWDSVI FDIGRDSLQR
VPTLEPLRGT KAHVGELLAR CPTAADLVDA LSGN