Gene Franean1_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0420 
Symbol 
ID5668843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp494865 
End bp495920 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID641239352 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001504791 
Protein GI158312283 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAG GGCGAAGGGG AACAGGTGGC CGGATACTCA ACGGACGGAA GGCATGCGTA 
ATGCAGAGCG CCGAAATGGT CCTTGGTGTC CTCCGTGAAC GTGGAAGGAG AGGACTGCCG
CTGGAGCGGG TGTATCGACA GTTGTTCAAC GCGGCGCTGT ACCTGGTGGC CTACGGGCGT
CTGTACTCCA ACAAGGGTGC GATGACGCCC GGGGAGACCG TGGACGGCAT GTCGCTGGCT
ACCATCGACC GCATCATCGA TGCGATGCGC CACGAGCGCT ACCGATGGAA ACCGGTGAAG
CGGGTGCACA TCCCGAAGAA GAACGGGAAG AAACGCCCGC TGGGCCTGCC GACCTGGTCG
GACAAGCTGG TCGCCGAGGT GGTGCGCCTG CTGTTGGAGG CGTACTACGA GCCGACCTTC
TCCGACCACT CCCACGGGTT CCGCCCAGGC AGAGCCTGCC ACACCGCACT CGGTGAGGTG
GTCGATGTCT GGAAGGGGAC GCACTGGTTC ATTGAGGGCG ACATCGCCCG CTGTTTCGAG
GAGCTCGACC ATCAGGTCAT GCTCGACACG GTGGGCGAGA GAATCCACGA CAACCGGTTC
CTGGGGCTCC TGAAGGCCAT GCTGCGCGCG GGGTATCTGG AGGACTGGAA ATGGGGAGCG
ACACTGTCCG GAACGGTACA GGGCGGTCCG GCGTCCCCGA TCCTTTCCAA TATATATCTC
GACCGGCTGG ACAGCTTCGT CGTGACACAC CTGCTCCCGG ACTACAACCG GGGCGAACGC
AGGGCATCCA ACCCTGCCTA CCAGAAAATC GAATATGCGA TCGCGCGTGC CCGACGGCAC
GGCGACCGGC CAGCATTACG CCGGCTTCGC CAGCAACGCC GCCAGCTGCC CAGCCAGGAT
CCCCACGATC CCAGCTATCG GCGGCTACGG TACGTAAGGT ACGCCGACTT ATGCCGACGT
CGGCATAAGT CCGCTTATGC CGAGGGCCGG GTTATGCCGA CCGGACTGCT GGAGGGTCTG
CGTGTCAGGC AGGTCCTCGT CGTGGGGGCG GCATAA
 
Protein sequence
MSTGRRGTGG RILNGRKACV MQSAEMVLGV LRERGRRGLP LERVYRQLFN AALYLVAYGR 
LYSNKGAMTP GETVDGMSLA TIDRIIDAMR HERYRWKPVK RVHIPKKNGK KRPLGLPTWS
DKLVAEVVRL LLEAYYEPTF SDHSHGFRPG RACHTALGEV VDVWKGTHWF IEGDIARCFE
ELDHQVMLDT VGERIHDNRF LGLLKAMLRA GYLEDWKWGA TLSGTVQGGP ASPILSNIYL
DRLDSFVVTH LLPDYNRGER RASNPAYQKI EYAIARARRH GDRPALRRLR QQRRQLPSQD
PHDPSYRRLR YVRYADLCRR RHKSAYAEGR VMPTGLLEGL RVRQVLVVGA A