Gene Franean1_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4251 
Symbol 
ID5672606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5065935 
End bp5068634 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content70% 
IMG OID641243124 
Producttetratricopeptide TPR_4 
Protein accessionYP_001508541 
Protein GI158316033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.492439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGGTT GGAATGACGC CGCGGGCGTG AGTGTTGGCT ATTTCATTTC CTACGCGGGG 
TCTGACCGGT TGTGGGCGGA GTGGGTCGCG GCTGAGCTCG AGACGGCAGG TGAGACCGTG
GTGCTGCAGG CGTGGGACGC GGTTCCCGGC GAAAACATTG TCGTCTGGAT GAGCCGCTCC
ATGGCGGCGG CTCGGCGGAC CATCGCGTTG TACTCGCCCT CCTATTTTGA ATCGAGCTGG
TGTACGGCGG AGTGCACGGT GGCGTTGAGT CGGCAGGTGC TGCTGCCGTT CAAGGTAGCG
GAATGCGACC CGCCCGCGGT GCTTGCCGCA ATCGGGCACA TCTCGCTCCA CGGGGTGGAC
GAGGCCGCCG CACGACGGAA ACTGCTGCGA GCCGCGGGCC TGGAGGAGAC CCCACGGAGG
TTCGACGGTC GATTTCCTGG CGGGTCAGCG CGCCGGGCGA GCGCGGGGAA CGATGCCGAC
GAAGCACCTG TCGTGCCGTT TCCTGGGTCC CTGCCCAAAA TGTGGAATGT GCGTTGGCGC
CGTCCCACGT GGTTCGTGGG TCGGGACGCG ATGCTTACGG GCATGTACGA CAGGTTTCGG
GCAGCGGGTG TCGACAGAGT CAGCTCCCAG GTGGTGATCG GGATCGGCGG AGTCGGGAAG
ACGCAGCTCG CGGTCGAGTA CGCCTACTGT TTTGCGGCCC GGTACTCACT GGTCTGGTGG
GTGGATGCGG CTGCGTCGGC GGCGGTGGTG GAGTCGTTCC ACGGGCTGGC CGACGCGCTC
CGGTTGCCCG ACGACCCCGA TGTTGAACGG CGGGCCAGGC GGGCACTGGC ATCGTTGCGT
GACCGGACCG GCTGGCTGGT CGTGTTCGAC AACGTCGAGG ACCGTCACCT GCTCGCCGAC
TGGTGGCCCG TCGCCGGCCG GGGAGACGTC CTGGTAACGG GCCGTAGTCG CATGTTGGGA
GAGTTCGGCG AGATACGCAC TGTCGTACCG TTCTCACCCG ACGAGGCGGC GTCGCTGCTG
CGGTGCCGCG CCGACCACCT CTCGGAGTCC GACGCCCTAC GCGTTGCAGA GGTGCTCGGC
CACCTTCCGC TCGCCATCAG CCAGGCCGCG GCCTACCTCG CCACGACCGG GGTCAGCGCC
GACGATTACC TCGAGCTGGT CGCGACAGCT GTGTCGACCG CGTTCGCCGA CAGCCCGTCC
GACTACCGGG CCGGCCTGCT GGGCTCGGTG GCCACCGCGA TGGACCGGCT CGTGCGCAAC
GACCCACCGG TTGCGCAGGC ACTGCGTCTC GCGGGGTTCC TCGCTCCAGC GCCGCTGGCA
TCACACGTAC TGGACGCCGT CACGGCGGCG GTTCTGCCTG AGCTGCCACC GGTCGTCGCG
AGGACCCGGG TACTTCGCGG CATCGACACG TCCGCGCTGG CGCAGGTCTC CAGCGGGACC
TTCGAGCTGC ACCGGCTCAC CCAGGCCGTG CTGCGCGCCC AACTCGCCGT CGTGGACCGG
GAACGGACGA TCGCGCAGGC CACGGACGTC CTGCTCGCGG CGGCGCCGGC CGACGCTGGC
GACCCCGCTA CGTGGCCCGT GTTCGCTGAA CTCGCGGCGC ATGTGCCGGT GCTGTTCCGG
TATGTGGACG GCGGAGGCCG CCCGGCGTTG CGTGAACTGG TGCTGGCGGT CGTCGACTAC
CTGACGAGGA CCGGCCAGCA CGCCGCGGCG GTCCGTCTGG CGGGCACGGC GGTCGACACG
TGGACGCGGC TTGGCGGGCT GGACAACCTT GATCGGCTCG CCGCCGCGCA CCGCCAGGGC
GAGGCGCTAC GCGGGGCAGG GCGCTTCGGC GAAGCCGAGG TCGTCGACCG CGACACCCAT
GCGCGGCGCC TGCGGGTCCT CGGCGCCCAG AACCGGGAGA CGTTGCGTTC AGCGGGCGCG
GTCGGACTCG ACCTGCGGGG TGTCGGGGAC CGGGCCGGTG CCCGTGACTG GAACACCGCG
GCGCTGGCTA CCGCGCGTGC CGTCCTGGGC GGTGATGACC CGCAGACGCT GGAGATCGCC
GGCAGTCTTG CCCTTGACCT GCACGGTCTC GGGGAGGTAG CGGCGGCACG TGAACTTGAT
GAGGAGGTCC TCGCCGGGCG ACGTGCCGTG CTGGGTGAGA CTCACTGGCA GACTCTGTCG
TCGGCCCGCA ACCTCGCCCG AGATCTGCGC GCTCTCGGCC TGCAGGAGCA GGCCCGCGAC
CTGGCGCAGT GGACCTTGGA GACCTCGCTT CGGGTACTCG GGGCGGACCA CCCCGACACA
CTGCTGGCCG CCAGCAGCCT CGCGGTGCTG CACTACGTCC TCGGCGACCT CGAAGCGGCC
CGTGATCTGC ACCAGGACTC GCACAGCAGG TCGAGTCGGG TCCTTGGCCC GGACCATCCG
CATACTCTGC GCATCGCGAA CAGTCTCGCC GTTGACCTGT TCCGGCTCGG TGACCTGCAG
GCCGCGCACG ACCTGCACCG TGACACCTTC GACCGGCTTC GCCGCGCCCT CGGTGACGAC
CACCCGGAAA CCCTGCACGT GGCCCACAAC CTTGCCCGGG ACCTGGGCGG GCTCGGCCGG
TACGACGATG CCGTCCGACT CCTCGAGGAC ACCCTCCGCC GCCGCAGATC CGTGCTCGGA
TCCGAACACC CTGAGACCCG CCGCACCGAA AGACGCCTCG CCAGAACTCG CGGCAGGTGA
 
Protein sequence
MQGWNDAAGV SVGYFISYAG SDRLWAEWVA AELETAGETV VLQAWDAVPG ENIVVWMSRS 
MAAARRTIAL YSPSYFESSW CTAECTVALS RQVLLPFKVA ECDPPAVLAA IGHISLHGVD
EAAARRKLLR AAGLEETPRR FDGRFPGGSA RRASAGNDAD EAPVVPFPGS LPKMWNVRWR
RPTWFVGRDA MLTGMYDRFR AAGVDRVSSQ VVIGIGGVGK TQLAVEYAYC FAARYSLVWW
VDAAASAAVV ESFHGLADAL RLPDDPDVER RARRALASLR DRTGWLVVFD NVEDRHLLAD
WWPVAGRGDV LVTGRSRMLG EFGEIRTVVP FSPDEAASLL RCRADHLSES DALRVAEVLG
HLPLAISQAA AYLATTGVSA DDYLELVATA VSTAFADSPS DYRAGLLGSV ATAMDRLVRN
DPPVAQALRL AGFLAPAPLA SHVLDAVTAA VLPELPPVVA RTRVLRGIDT SALAQVSSGT
FELHRLTQAV LRAQLAVVDR ERTIAQATDV LLAAAPADAG DPATWPVFAE LAAHVPVLFR
YVDGGGRPAL RELVLAVVDY LTRTGQHAAA VRLAGTAVDT WTRLGGLDNL DRLAAAHRQG
EALRGAGRFG EAEVVDRDTH ARRLRVLGAQ NRETLRSAGA VGLDLRGVGD RAGARDWNTA
ALATARAVLG GDDPQTLEIA GSLALDLHGL GEVAAARELD EEVLAGRRAV LGETHWQTLS
SARNLARDLR ALGLQEQARD LAQWTLETSL RVLGADHPDT LLAASSLAVL HYVLGDLEAA
RDLHQDSHSR SSRVLGPDHP HTLRIANSLA VDLFRLGDLQ AAHDLHRDTF DRLRRALGDD
HPETLHVAHN LARDLGGLGR YDDAVRLLED TLRRRRSVLG SEHPETRRTE RRLARTRGR