Gene Franean1_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3164 
Symbol 
ID5671541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3728070 
End bp3729293 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content70% 
IMG OID641242059 
Producthypothetical protein 
Protein accessionYP_001507479 
Protein GI158314971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.932938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCGGC CGGTGGTGCG GACCGAGCGG CGGCAGGTGT TCGACCTGCC CGAGATGCGT 
CCGCATGTGG TCGAACACGA GTTGGTCGAG CGGGAGTGCG GCTGCGGGAG GCGGACGCGG
GCTGCCGCGC CGGCGGGGGT GGATGCCCCG GTCCAGTACG GGCCGCGGGT CACCGCGGCG
GCGGTCTACC TGTACGCGGG CCAGTTCCTG TCCAAGGACC GGACCGCGAC CGCGCTGGCC
GAGCTCGTCG GGATCCCGCT GTCCGCCGGC ACGGTCGCGG TGATGACGCG CCGGGCTGCC
GCCGGCCTGG ACGGTTTCCT CACCACTGTC CGCGGTCTGC TCGCGGGCAG CGAGGTCCTC
GGGGCCGACG AGACCGGGCT GCGGGTCGCC GGGAAGCTGC ACTGGGTCCA CTGCGCCCGC
ACCGACAAGT ACACCCTGAT CGACTGCCAT CCGAACCGCG GGAGGGCCGG GATCGACACG
CTGGGAGTGC TTCCCGGCTT CGGTGGGGTC GTCGTCCATA ACGCCCGGGC GCCCTATGAC
AGCTACACCG ACGCGACCCA CCAGCTGTGT GTCGCTCACG TGCTACGCGA ACTACAGGCC
GTCGTCGAAG GCGCCCAGGC CGGGCAGTGG TGCTGGGCCG CCCAGGCCAC CGACGCGCTC
GTCGCCCTCC ACACGCAGAC CACCGAAGCA GCCGCCGCAG GCGCGGCCGG CCCAGATCTG
GCCGAGCTGG CCGCTCAGAC CCGGCTGCTG CGCCACGCCG CCCATATCGG GATCAGCCAG
ACCGCCGACC GGGACACGAA ACTCATGGCG GCCCGCCACG CGCTGGCCTG CCGCCTCGTC
GACCGCGAAG CCGACTACCT ACGCTTCACC CGGGACCTGC GGATACCGGC GGACAGCAAC
GGCTGCGAGC GCGACATCCG CATGATCAAA CTACGGCAGA AAGTATCCGG GTGCCTACGC
ACCCTGACCG GCGCCCGCCA GTTCCGCGCG ATCCGAAGCT ACCTGTCCAC CGTCACCAAA
CACGACCTCG GTTCTGTTCC ACGCCCTCGT CCAGCTGGCC GAAGGCCGCC CCTGGACGCC
CGCAACAGCC TGACCCCAAA CCAAAGATCA AAAAAGTACC TGACCAGTTA CATCGCCTCG
ACCCGAGAGC GTGAGACCGC TACTGCACCC ACCCGATCGG CGGGACGGGT TACCGACTAC
CAGCTAAGGT CAACGTCCAC TTAA
 
Protein sequence
MGRPVVRTER RQVFDLPEMR PHVVEHELVE RECGCGRRTR AAAPAGVDAP VQYGPRVTAA 
AVYLYAGQFL SKDRTATALA ELVGIPLSAG TVAVMTRRAA AGLDGFLTTV RGLLAGSEVL
GADETGLRVA GKLHWVHCAR TDKYTLIDCH PNRGRAGIDT LGVLPGFGGV VVHNARAPYD
SYTDATHQLC VAHVLRELQA VVEGAQAGQW CWAAQATDAL VALHTQTTEA AAAGAAGPDL
AELAAQTRLL RHAAHIGISQ TADRDTKLMA ARHALACRLV DREADYLRFT RDLRIPADSN
GCERDIRMIK LRQKVSGCLR TLTGARQFRA IRSYLSTVTK HDLGSVPRPR PAGRRPPLDA
RNSLTPNQRS KKYLTSYIAS TRERETATAP TRSAGRVTDY QLRSTST