Gene Francci3_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1074 
Symbol 
ID3906417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1279455 
End bp1280705 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content75% 
IMG OID637878408 
Producthypothetical protein 
Protein accessionYP_480185 
Protein GI86739785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0537877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGG ACGACGTCAT CAGCACGGGC GTGACCTGCG CGCGCCCGCA GCCCGTCGGG 
GTCTTTCCGC TGCCGGCCGG CTTCCTGCTC GTGCCGGGCG GCGAGGCCAC GGCCGACCTG
CGCCGGACAC TGGCGGCCGG GAAGGTGCCG GTGGCCTGGC CGGCGGAGCT GGCCGCGCTG
GAGCTCGCCT ACCGCGGGGA GGTCGTCGCG GCCCTCGGGC AGCTGCGCGG CGACGACCCG
GTCGACCGCT ACAACCGCTT CGTGCTGCGA CCAGCGGGCG CGGACCCCGA AGACCCGGTG
GCACTGCGCC GGGCGCTCGG CGACGAGCTG GGGATCCTGG TGGACGTAGT CCGGTTCGCC
CTGGGCGAGC TGGACGAACC TCCGCCGCCC GGCGGGGAGA CCGACGAGAT CGCGGCGATG
GTGTGGTCCG CACACGCCGC CCACGCCATG GCCGCCGGCC GGACGGGCGA GGCAGCGGGG
CTGCTGGAGC GGGCCATCGC CGCGGCGCGG GAACCCTCCC CCGGTCTCGC CGCCCAGCTC
AGGTCCACCG CGGCGGACCT GCGTCGCGGA GTCGAGGGAC CGAGCCCGGC CGTCATCGCA
GAGCTGACGG CGGCGCTCGC CGCGCTGGCC GCCACCGACC TCACGGTCGG CCGGGCCGAG
TTGCACCTGA GCCTCGGTTC GGCCTACCAG GAGCTGGCCG GGGACGATCC GGCGGGCCTG
AAGGTCGCGG TGGAGCACTA CCTCTCGGCG CTGCGCCTGG TCCGCATCGA CACCGCCCCG
GAGCTGTTCG CGGCGGCACA GGTCAACCTC GCCACGGCGT ATCTGACCAT GCCGATGGCG
CAGGCGTCCG ACCAGCTGCG GGTCGGCGTG GCTGTGCAGG GCCTGCGGAC CGCTCTGTCC
GTCTACACCC GCGAGACTCA TCCCGAGCGG TGGGCCAGCA CCCAGCTCAA CCTGGCCAAC
GCCTTGGTCT ACGCGCCGTC CGCACACCGG GAGGACAACC TGCGGGAGGC GGTCGCGCGC
TACCAGGAGG TCATCGCCGC CCGCGACCGC GACGCCGACC CCCTCGGCTA CGCCCGGGCT
CGGGCGAACC AGGGCAACGC CCTGGCCCAC CTCGGCCTCT TCGACCCGGC GCAGGCGGTG
CTGCACGAGG CACGGGCGAT CTTCGAGGAG GTCGGGGACC CCGACGCCGT CCTGGCCGTG
CGCGGGGTGC TCGACGAGAT CGCCCGCCGA CTCACCGCGA AGCCGACGTG A
 
Protein sequence
MGEDDVISTG VTCARPQPVG VFPLPAGFLL VPGGEATADL RRTLAAGKVP VAWPAELAAL 
ELAYRGEVVA ALGQLRGDDP VDRYNRFVLR PAGADPEDPV ALRRALGDEL GILVDVVRFA
LGELDEPPPP GGETDEIAAM VWSAHAAHAM AAGRTGEAAG LLERAIAAAR EPSPGLAAQL
RSTAADLRRG VEGPSPAVIA ELTAALAALA ATDLTVGRAE LHLSLGSAYQ ELAGDDPAGL
KVAVEHYLSA LRLVRIDTAP ELFAAAQVNL ATAYLTMPMA QASDQLRVGV AVQGLRTALS
VYTRETHPER WASTQLNLAN ALVYAPSAHR EDNLREAVAR YQEVIAARDR DADPLGYARA
RANQGNALAH LGLFDPAQAV LHEARAIFEE VGDPDAVLAV RGVLDEIARR LTAKPT