Gene Francci3_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1107 
Symbol 
ID3905778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1321240 
End bp1322889 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content71% 
IMG OID637878439 
Producttransposase 
Protein accessionYP_480216 
Protein GI86739816 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.373309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.299253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCGGA TCAACGAGGT GGTGCTCGCG GAGAAGTTCG CGGTGCTGTT GCCGCATCTG 
GATGAGCGGC AGCGCCGGCT GGTGCTGGGT GCGGACGCGC GGGCGTTCGG GCATGGCGGG
ATCCGTCTGG TGGCCCGGGC GGCCGGGGTA TCGGTGGACA CGGTCTCGCG TGGTGTCGCC
GAGCTGGGGG CGGGCGCAGC CTCGACGGGC CGGGTGCGCG CGCCGGGTGG GGGTCGTAAG
GCGTTGCGGG AGAAGGATCC GGAGCTGGTG GCGGCGCTGC TGGCGCTGGT CGAGCCCGAC
CAGCGGGGGG CTCCGGAGTC GCCGTTGCGG TGGACGGTGA AGTCCACCCG CCGGCTCGCC
GAGCAGCTCA CCGCACGTGG GCATCCGGTC GGCGCGGATA CGGTCGGTGG GCTGCTGCGG
GCGGAGGGGT TCAGCCTGCA GGGCACCTCA CGCACGACCG AGGGCGCACG TCACCCTGAC
CGGGACGACC AGTTTCGCTA TATCAACGAA CGGGTCAAGG AGTTCACCGC GGGCGGGCAG
CCGGTCGTCA GTGTGGACAC GAAGAAAAAG GAAGTCCTCG GTGACTACGC CGTCGCCGGA
CGGGAGTGGC ACCGTAAGGG GCAGCCGGTG CGGGTCCGCG CCCATGACTT TCCCGAGAAG
GGCGCGCAGA AGGCAGTGCC CTACGGGGTC TACGATCTGG CCGCCGACAC CGGCTGGGTG
TCGGTCGGCT GCGACGGGGA CACCGCCGCG TTCGCGGTCG CGACCCTGCG TCGCTGGTGG
GACGGGGAAG GCCGTCACCG CTACCCGACC GCGACCCGGC TGCTGATCAC CGCGGACGTC
GGCGGGGCCA ACGGCTACCG GGTACGTGCC TGGAAGAAGG AACTCGCCGA CCTCGCCCGC
ACGACCGGCC TGCAGATCAC CGTGTGCCAC TTCCCGCCGG GCACGTCGAA ATGGAACAAG
ATCGAGCACC GGTTGTTCTC CCGGATCAGT ACGAACTGGC GTGGCCGGCC GTTGACCAGC
CACGAGGTCG TGGTCAACAC GATCGGCGCG ACGACGACTC GCACCGGGCT GAGCGTCCAC
GCCGAACTCG ACCCCGGCTC CTACCCGACC GGGCTGACCG TGCCCGACGA GGTCATGGAC
GCCCTACCAC TGACCGCCCA CGACTGGCAC GGCCCGTGGA ACTACACCCT CGCCCCGGCG
CCACCCCGCG CCGTCTCGAC GCCGGCGTCC CGGTACGTCG AGACCGGCCA GCCCGACGAC
CGGGCCCCGG ACTGGCTACA CCATCCGACG ATCACCGGGA TGAACGGCGG CGAGTACGCC
ACCCTGCTCG CCTCCGTCGA GCAGTACATC CTCGACCACC CGCCCATCAG CCTGCACCCC
AAGCGCGCCC GTCACCGGGT CCTGCGACGC GGGCCCCTGT CGCTGTCCGA CCGGCTGCTG
GTCACCGTGA TCCACCACCG GTGGACCACC CAGCAGCAGG CCCTCACCCG TCTGCTGGGC
TCACCCCGCG GAGCCGTCGG CGACGCGATC CACGAGATGA CCCCAGTCCT GGACGGCCTC
GACCGGCGGA TCCAACCCGC GCCGATCACC GCGCCCACCG TCCAGGACCT CACCACCCTG
ATCCACAACA TCAAGAACGG ACCTTATTAA
 
Protein sequence
MERINEVVLA EKFAVLLPHL DERQRRLVLG ADARAFGHGG IRLVARAAGV SVDTVSRGVA 
ELGAGAASTG RVRAPGGGRK ALREKDPELV AALLALVEPD QRGAPESPLR WTVKSTRRLA
EQLTARGHPV GADTVGGLLR AEGFSLQGTS RTTEGARHPD RDDQFRYINE RVKEFTAGGQ
PVVSVDTKKK EVLGDYAVAG REWHRKGQPV RVRAHDFPEK GAQKAVPYGV YDLAADTGWV
SVGCDGDTAA FAVATLRRWW DGEGRHRYPT ATRLLITADV GGANGYRVRA WKKELADLAR
TTGLQITVCH FPPGTSKWNK IEHRLFSRIS TNWRGRPLTS HEVVVNTIGA TTTRTGLSVH
AELDPGSYPT GLTVPDEVMD ALPLTAHDWH GPWNYTLAPA PPRAVSTPAS RYVETGQPDD
RAPDWLHHPT ITGMNGGEYA TLLASVEQYI LDHPPISLHP KRARHRVLRR GPLSLSDRLL
VTVIHHRWTT QQQALTRLLG SPRGAVGDAI HEMTPVLDGL DRRIQPAPIT APTVQDLTTL
IHNIKNGPY