Gene Franean1_6669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6669 
Symbol 
ID5674984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8099189 
End bp8100610 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content72% 
IMG OID641245520 
Producthypothetical protein 
Protein accessionYP_001510912 
Protein GI158318404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00124083 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.737923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGATT CACCGCAGAC TTCCGAACAT TCCCGCGAAC GACCGCGCCG TCCTGGCCCG 
GTACGGGCTT TCGGGGACAT GCTGGCATTC TTCGCCGGTG CGCAGGCGAA ATTCCTCCCG
CCGACGGAGC GCGCCCGTTT CGTCACGGCC GGCGCGCTGA TGCTTCTGAC AGCGGCACTT
GCCACCTGTG CGGGGGCGAC GGTGGTGGCC CTCGGTTTCG GCATCGGGAC CCTCCAGGCG
CTGCCGTTCG GCATCTTCTA CGCGCTGTTC ATCTTCTTCA TCGACCGGTC CGTCCTGCTG
ACCCAGACCC CGTACCGCTA TGGCGCGGAC GGCGGTGTGG AAACCGGCCG GGCTGGATTC
TCCGTGGCGG TCCGCGTGTT CATCGCGGTG TGCGCGGCGA TAATCGTCGG GGAGACGGTA
CTGCTCCGGA TCTTCGAGTC CTCGATCGCG AGCCGGGTCG CCGAGATACA GCAGGAGGAC
GCCGGGCACC TACTGGCCGG GTGGGACGCG AACCAGGAGA GCGAGCTCGC CGCCCGCACG
GCCGATCTCG CGGCGAAACA GAAAGGCCTC GACGCCGCCG ACGATCTCGT CGAGGCGAAG
ACGGCGGAGG TGAACTGCCA GCTCACCGGT GGCCAGTCCG GCGATGGCCA GTCCGGCGAT
GGCCCGGCCT GCCTGGGCGG GGCCGGCCCG GTCTACCAGA TCAAGCTGGC CGAGCTGGCC
GCGGCCACCG CCGCCGTCAC CGACGCCACC CGGCTGCGCG ACGCCGCCCA ACGTGATCTT
GACGAGTTTC GGGCTGCGCA GAAGGCCAGG CGCTCCGACT TCGCGGCCAC GGTGCAGACC
ACGACGGGCG CGGCCGACGA CCTGCTGATG CGGGAGAAGG CGTTCTGGCG GCTGACCACC
GAGGACCGCT CCGTCCTGGT GTGGCGCCTG TTGCTGACGT TGCTCCTGCT CGGCATCGAC
CTCGCCCCGC TGCTGTTCAA GCGTGGTCTG GACCGCACCT CCTACCGGCA GCGCGAGCGC
CTCGAGCGCT GGCGGGACGA GACCTCCGTC GAGGTCGACG CGCTGCAGGT CGGGCACACC
GCCCGCGAGC GCCGCGACCT GGCCCCGGTG GTCGCGGCGC GTCTCGCCGG GCGCTGGGAG
GACTACCTGC TGCGCCGCGA CAGCGTCGAG ACCGCCGTGC GCTGGACGGC CGACACGGCG
CAGGCGCGCC TCGCCGAGGA GGAGATCAGC GCAGACCAGG AGTCGCGGCT GCGTGAGCTG
CGGCGACGGC ACGGCATCGT CGCCGTGCCG CGCGTCCCGA CCTCCGACGC GGAGCCGTCC
ACCCCGGCTG TCACCACCGG TTCGGCCGCG GCAGCCTCAG CCGCAACCGG ATCGGCGGTG
GCCACCGGCC CGGCGGTGCC GGCGCCGCCG GAACCGCCGT GA
 
Protein sequence
MPDSPQTSEH SRERPRRPGP VRAFGDMLAF FAGAQAKFLP PTERARFVTA GALMLLTAAL 
ATCAGATVVA LGFGIGTLQA LPFGIFYALF IFFIDRSVLL TQTPYRYGAD GGVETGRAGF
SVAVRVFIAV CAAIIVGETV LLRIFESSIA SRVAEIQQED AGHLLAGWDA NQESELAART
ADLAAKQKGL DAADDLVEAK TAEVNCQLTG GQSGDGQSGD GPACLGGAGP VYQIKLAELA
AATAAVTDAT RLRDAAQRDL DEFRAAQKAR RSDFAATVQT TTGAADDLLM REKAFWRLTT
EDRSVLVWRL LLTLLLLGID LAPLLFKRGL DRTSYRQRER LERWRDETSV EVDALQVGHT
ARERRDLAPV VAARLAGRWE DYLLRRDSVE TAVRWTADTA QARLAEEEIS ADQESRLREL
RRRHGIVAVP RVPTSDAEPS TPAVTTGSAA AASAATGSAV ATGPAVPAPP EPP