Gene Franean1_4418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4418 
Symbol 
ID5672770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5275694 
End bp5277013 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID641243286 
Producttetratricopeptide TPR_4 
Protein accessionYP_001508703 
Protein GI158316195 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGCACG CTGATGGAGT CGAGGCCCTG AAGGAGCAGC TCGCCACTCG GTTCCGGCAA 
CTTCAGGCCG AGTACCGGCT GTCGGGGACG GACCTGGAGA AACGCACCAC CCACGATCGG
AAGAACGTCT CCGCCATCCG CAACAGGGGA CGGCTCCCGA CCCGTGACAT CCTCCGGGCC
TACGACCGGG AGTTCGGGAC CGGCAACGAG CTGACTGACC TGGGTGAGCG GATCCGTGCC
GCGCAGAAGG CAGTGCGGCT GACCGAGGTG ACGGCCGCAG TGGGGGTAGA CACCCCCGGG
CCGCAGCGGG CAGAGTCGGG AAGGGAGGAG GTGGAGGAAA CGGATCGACG CCAGATCGTT
GCGATCGCTG CGCTATCCGC ACTTGCGTTC GAAACGACCC GCCGTATCGA CACCTCTGCC
ACGGCACCGA CCCTCGGCGA ACTCGAAGAC GACCTGGCCG ACATCGCTGC CGGCTACGAC
ACCACCCCGC ACCAGATGCT CGTCGGTGAG GTCGCCCGAC GATGGCACCA GGTCGAAGAC
ATGCTCGACC GCCGCTTGAG CGTGACCGAC AGTCTGCGGA CTACTCGCCT CGGCGGCCAG
CTCACCTACT ACCTTGGCCG GCTCGCGTTC GCCGGCGGTC ACTACCGAGA CGCCCGCCGG
TTCTGCGATC TTTCAGACCG GTACGCCGAC CAGGTCGGCG ACGAGATGCT GACCGGCTCC
CTCGCCGCGC TGCGGTCCAG CATCGCCTAC TACACCCACC GCTGGGACAA GGCTGCCCTC
ACCGCCGCCC GGGGCCGCCG CGGCGCGGAA CCCTACCTTG TCGCCCGGCT GGCCGCTTAC
GAGGCGCGGA GCCATGCCCG CCTCGGCCGG GTCCGGGAGA CAGAGAACGC TCTCGCCGTG
ATGAGGGCCC ACGCCGGGGT GGCAACGAGG CCCCGGCCCG GCTCCTCCCC GTTCACGGCT
GGCAGCGCGG CCATGTTCGC CGCGGTATGC GCGATCGAAC TGGGTGACGG CGCCGAAGCC
CGGCGCCACG CCCGCGAAGC GGTCGATCTC ATCGACCCCC GGTCCCATGA AGAACGCGGC
CACGCCTACC TCTGCCTCGC CTCCGGCTTT CTTCTCCATG ACCGGCCCGA TCCTGCCGCA
GCGATCGCGG CGAGCCGGGC CGCCGCGGCT GTTCCCGACG GGCACCTGTC CGCCACGGTC
GTGTCCGCCA TGTCCGAGGT GGTCCGGGAG CTTGGGCCGT GGGCCTCGGA TCCTGACGTC
CGCGCGTTCG GCGCGCTGGT CCAGCAATCC CGCCTCGCGC TGCCTGGGAG CCCCGTATGA
 
Protein sequence
MVHADGVEAL KEQLATRFRQ LQAEYRLSGT DLEKRTTHDR KNVSAIRNRG RLPTRDILRA 
YDREFGTGNE LTDLGERIRA AQKAVRLTEV TAAVGVDTPG PQRAESGREE VEETDRRQIV
AIAALSALAF ETTRRIDTSA TAPTLGELED DLADIAAGYD TTPHQMLVGE VARRWHQVED
MLDRRLSVTD SLRTTRLGGQ LTYYLGRLAF AGGHYRDARR FCDLSDRYAD QVGDEMLTGS
LAALRSSIAY YTHRWDKAAL TAARGRRGAE PYLVARLAAY EARSHARLGR VRETENALAV
MRAHAGVATR PRPGSSPFTA GSAAMFAAVC AIELGDGAEA RRHAREAVDL IDPRSHEERG
HAYLCLASGF LLHDRPDPAA AIAASRAAAA VPDGHLSATV VSAMSEVVRE LGPWASDPDV
RAFGALVQQS RLALPGSPV