Gene Franean1_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2454 
Symbol 
ID5670850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2917687 
End bp2919747 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content77% 
IMG OID641241371 
Producttetratricopeptide TPR_4 
Protein accessionYP_001506792 
Protein GI158314284 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.928825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGACCG GCGGCCGACC GGACCCACGT GAGGGTGTCC GTCCCAGGCT CCCCGCGACC 
CGCGCCACGG GCGAGTGGCA GCCGGCCGAA TGGAACGTCC CGCGCCGCGA CCCGTTCTTC
GTGGGCCGCG ACGCGCTGCT CCAGCGCATC CACCGCGAAC TGCGCCCACC AGGAGTCGGC
CGGGGACCCG ACTTCGGCCG CGGGCGTGAC GCCAGCCGGG GGCGCAATGC AGGTCAGGGG
CGTGACATCG GCCGGGGGCG CAACATCGGG CTGGTGTCCG CGCGTGGCGC CGCCGGGATG
GGCGCGTCGC GGCTCGCCGT CGAGTACGCG CATCGGCATG CCGGAGACTA CGGCCTGGTG
TGGTGGGTGG ACGCGGACAC CCCCGAGCGC GCCGAGTCGT GCCTGGCCGA GCTGGCCGCC
GCCATCGCCC CGCCGGGGGC AGGCCACCGG GCCGCCGTGC TGCGCCGCCT GTGGGCCGAG
CTCGGCCAGC GCACGGACTG GCTGCTGATC TACGACGGTG TCGGTGATCC CCGCGACCTG
ACCGCCGTGG CACCGCCGGA CAGCGGCCGG CTGCTCGTGA CGAGCCGCGG ACCAGCGGTG
GCCAGGCTCA CCCCCGTGCT GCTCGAGGTC GGCGAGCTGC GCCGGGACGA GTCGGTCCTG
CTACTACGCC AGCACCACCG CGGAATCAGC CCGCCGGCCG CGGAGGCGCT TGCTTGTGTG
CTGGCGGACG TCCCCTTCGC GGTCGCGCTG GCCGGCCGCC ACCTGGCGAC GACCGGCCAG
CCGGTCGCCG ACTACCTGGC CCTGCTCGGG CGGGAGGCCG CTTCCGATCC GGTCGCCGCG
GCCGTCGCAG CGAGCCGCGC GCGACTGGAC GTGGTCGATC CGGCCGCGGC CGGCCTCCTC
GACCAGGTGG CGTTCCTCGC AGCGGACCCC CTGACGCTCA CCCGGGCGCC AGCCGGCACC
GCCGCCGCGG CCGCCCGGCT CGACCGCCTC GGCCTGGCCG ACTGGGACGG GGCCACGATC
CGCGTGCACG GGCACGTCCA GGCTCTGGTG CGCCGGCGGC TGGCCGGAGG CCGGCGGCCA
GCGGCCCTGC TCGGCGCGCA GCGGCTGCTG GTCTGTGCGT ACCTGCCCGG TCGCGACCCC
GCGGACCCGG CCTCATGGCC GTCACTCGCC ACGCTCGACC CGCACATCCG CACACTCACC
ACGTGGCTCG ACGACGAGTG CGCGGACTTC CGCCGGCTGG TGCTGCGCAC CGCCCGGTAC
CTGGCCGCCT CCGCCCGCTA CGAAGCCGGC GAACGGCTCA CGCACCAGGC ACGGGCCCTC
TGGGGCGGAC GGCTGGGCCC GGACCACCCG GACACGCTTG CCGCCGCCGA CCTCGAGGCC
TCGATCCGCG CCGACGGGTT CGCCGATCAC GACACCGCGC GGGCACTGCT GGGCGACGTC
CACGCCCGCC GGGTGCGGGT GCTCGGCGAG CAGCACCGCG ACACGCTGCA CTCGGCGTGG
AACCTGGCGC GCGCGGCCGG GGAGAGCGGC GACCACGACG AGTCCGCCCG CTTGCTGCGA
GACACCGTCG CCCGGCAGCG GGCGGAGCTC GGCCCGGACG ACCGCGACAC CCTGCGTTCG
GCGCACAGCC TCGGCCACGC CCTGACCCGG CTGGGGGAGT ACGGCGTGGC CCACGAGCTG
CTCGCGGAGA CGCTCGCCCG CCGCGGCCGG ACGTCCGGCG CCGCCCACCC GGACACCGTG
TGGACCGCGG CGGTGCTCGG CGTCGCGCTG CGTGGCCTCG GCCGTGGCGC GCACGCCCGC
GACCTGCACG CCCGCGCGCT GGCCGGCGCG CGGGCACAGC TCGGCGACGA GCATCCGCTG
ACCCTGTACG CGGCGCTGCA CCTCGCACTC GACCTGGCCG CCCTCGGCGC GGCCGACGCC
GGCCGGGACC TGTTCACCGA AGTCATCGGC TGGGACGTCG TCGGATGGCG CGACAACTCT
CCGTCGGGCC CACGAACCTG GGTCGCGCAG TGGGCCCGGG GCCACCAGGA GGACATCCGC
AGACTGTGGT CGGGTAGCTG A
 
Protein sequence
MLTGGRPDPR EGVRPRLPAT RATGEWQPAE WNVPRRDPFF VGRDALLQRI HRELRPPGVG 
RGPDFGRGRD ASRGRNAGQG RDIGRGRNIG LVSARGAAGM GASRLAVEYA HRHAGDYGLV
WWVDADTPER AESCLAELAA AIAPPGAGHR AAVLRRLWAE LGQRTDWLLI YDGVGDPRDL
TAVAPPDSGR LLVTSRGPAV ARLTPVLLEV GELRRDESVL LLRQHHRGIS PPAAEALACV
LADVPFAVAL AGRHLATTGQ PVADYLALLG REAASDPVAA AVAASRARLD VVDPAAAGLL
DQVAFLAADP LTLTRAPAGT AAAAARLDRL GLADWDGATI RVHGHVQALV RRRLAGGRRP
AALLGAQRLL VCAYLPGRDP ADPASWPSLA TLDPHIRTLT TWLDDECADF RRLVLRTARY
LAASARYEAG ERLTHQARAL WGGRLGPDHP DTLAAADLEA SIRADGFADH DTARALLGDV
HARRVRVLGE QHRDTLHSAW NLARAAGESG DHDESARLLR DTVARQRAEL GPDDRDTLRS
AHSLGHALTR LGEYGVAHEL LAETLARRGR TSGAAHPDTV WTAAVLGVAL RGLGRGAHAR
DLHARALAGA RAQLGDEHPL TLYAALHLAL DLAALGAADA GRDLFTEVIG WDVVGWRDNS
PSGPRTWVAQ WARGHQEDIR RLWSGS