Gene Franean1_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3804 
Symbol 
ID5672168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4513652 
End bp4516114 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content76% 
IMG OID641242683 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001508103 
Protein GI158315595 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0951056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCCGC CGCCCGGTGC GGGTCCGCCG CCCGGGTCGG CGCCGGCGCC GCCGGCCCCG 
GATGGCCGGG CGGGCGGGCG GCCGGCGCGC CGGCGGATCC GCGCCGCGCC GGCGCCCACC
CCCGTCCGAA CGCCCGGGCC GCGATCGTTC CGACCCGGGC AGGTAGCCCT CGCCGCGGTG
CTGGTCGCGC TGGTCGCGGT CGCCTTCACC ACCTTCTTCA CCGGCGGCCC GACCGGGGGC
GCGGGGGTCG TCCTGCTCCC GGCGGTGGTG TTGGCGTCCG CGCTCGGCTG CCTCGCCGGG
GCCAGGCTGG GCGCCGGCTG GCTGGTCGGC CTGGTCGGCC TGATCGGTGC CCTGCTGTTC
GGCGTGCTCG CGCTGTTCGC CGCCAGGTTC GGGGACGGGC TCTCGGCGCT GTCCTCCGAG
TTCGGCTCGG CGGCGCGGGA CGGCTGGGCG CGCATGCTCA CCGTCGGGCT GCCCGCGCAC
CCCGGCGCGG ACCTGTTGTT CATCCCGGTG TTGGTGCTGT GGCTCGCGGC GTTTGCCGCC
GCCGTCCTCA CGGTGCGCAC CGATTCGGTG CTCGCGCCCG TGCTTCCGGC GATCGTCGGC
TACGTCGTCG CGCTGCTGCT GGTCGCGGCC CGGGGCCGGT CGCTGCTCGT CCTCACCGGC
CTGATCGCCC TCCTTGCGCT GGTGCTCGCG GTTGTCCGGG CGTCCCGGCT GGCAGCCGAA
GGGCAGCTCT CCGCGGTGGC GGTGAGGCCG GAGGCGGCGC CCGCGGCCCA GCCCGACGCC
GGGCGGGCCG ACGCGGATGG GACCAGCGGC GCCCGTGGCG GCGGAGCGGG CGGGCCGGCG
CGGCCCCGGG TGGGTGCGGG TCGGCTCGCG CTCGGCCTGC CGGTCGCGGC TGTGACCGCG
CTGCTCGGCA CGATCGGCGC GGCGTTCCTG CCGATCGCCG ACGGCACGGA CCGGTTCGAT
CCCCGCGACC ACCGGCATCC CCCGGTCGAG ATCTCCACCT CGCTCAACCC GTTGGTGCAG
GTGAAGGCCG CGCACAAGGC GACGGCGGCG CGGAACCTGT TCACGGTGCA GCTGTCCGCG
GTCGGCGGCA AGGTCGCGAC CGACCGGCTG CGCACCGTCA CGCTGGCCGA CTTCGACGGT
GCCAGCTGGC GCGAGGACGG CACCTTCGTC CGCAGCGGAA GCACGCTGCC GGACGGCGAC
GGCCTGGCGC CCACGGTCGG CAACGAGACC CGCATGGAAG TGACCGTCGA CACGGCGAGC
GGTCCGTTCC TGCCGTCACT CGGGCGGCCG GTGCGCATCT CCGGCGCCAG CCTCGAGTAC
GCGTTCCAGC CCGACGCCGG TGTGCTCGCC GTCGCCGCTC CGGCCCGGAC CGGCGACCAC
TACGTCCTCA CCGCGCGCGT GCCCGGCCCC ACCGACCAGC AGGTCCGCGG CGCCGTGCCC
GCCTCCGGCC CGGCCGCCGC GCGGTACCTC GAGCTGCCGC CGGGCATGCC GGCGGAGCTG
CAGGACCTGG CGTCGCGGGT GATGAGCGGG AAGTCGAGCC CGTACGAGAA GCTCACCGCC
CTGGAGGACT TCCTGCGGGA CCAGGCCAAC TACCCGGTGG ACCTGAACGC CCGTCCCGGC
CACTCGTACG GTGCGTTGAA GCGGTTCCTG ACCGGCTCCA AGGCCGACAA CCGTGGCTAC
GTCGAGCAGT TCGCGACAGC GTTCGCGTTG CTCGCCCGGG CCGAGGGCTT CCCGAGCCGG
GTTGCCGTCG GATACCTGCT CGACAGCCGT TCCTCGTCTG CGCCCGGCAG GTTCACGGTG
ACGTCGAAGC AGGCGTTCGC CTGGCCGGAG GTGGCCCTCG ACGGTATCGG CTGGGTCGCC
TTCGACCCGA CCGATATCAG CAAGCTCGGC GCCACGCCGC CGGCACCCAG CGACGACCAG
ACGCCCGGTG GCGAGGGAGC TGCCCCGCAG GCGCAGACAG TCCCTCCCAT CGTCAAACCG
GAACTGGACC GGGCGGCCCA GACCGGCGGC GGTGGCGCTG GCGGCGCCCG GAACACCCTG
CTGCTCGCGC TGCTGGCGGT CGTTGCCGCG GCGGCCGCCG TCCCGGTCGG GATCGTCGGC
GAGAAGGCAC GCCGCCGCCA GCGCCGCCGC GCCGGTACGG CGGCGGCGCG GATCGGCGGC
GCCTGGCGGG AGGTCCGGGA CCGGCTGGCC GAACGGGGGG TGGACCGTTC GCGCGCCCTC
ACCGCGGACG AGGTCGTCGC ACGCACCCGG GCACTGCGCG GCGATGCTGC CGGTGAGCGG
GTGGGCAGTC TCGCGCCGGT GGTGAGCAGC GCGCTGTTCG CCGCGGCGGA GCCGGGCGAA
GCCGAGGCAC GGCACGCCTG GGAGCTGGCA GCGGCCGTCA GCCAGGAACT CCACCGGTCC
GACAGCCTGT GGCGGCGGGT CGTCGCCGCG GTCGACCCGC GTCCATTGCT GCCGGGGAGA
TGA
 
Protein sequence
MPPPPGAGPP PGSAPAPPAP DGRAGGRPAR RRIRAAPAPT PVRTPGPRSF RPGQVALAAV 
LVALVAVAFT TFFTGGPTGG AGVVLLPAVV LASALGCLAG ARLGAGWLVG LVGLIGALLF
GVLALFAARF GDGLSALSSE FGSAARDGWA RMLTVGLPAH PGADLLFIPV LVLWLAAFAA
AVLTVRTDSV LAPVLPAIVG YVVALLLVAA RGRSLLVLTG LIALLALVLA VVRASRLAAE
GQLSAVAVRP EAAPAAQPDA GRADADGTSG ARGGGAGGPA RPRVGAGRLA LGLPVAAVTA
LLGTIGAAFL PIADGTDRFD PRDHRHPPVE ISTSLNPLVQ VKAAHKATAA RNLFTVQLSA
VGGKVATDRL RTVTLADFDG ASWREDGTFV RSGSTLPDGD GLAPTVGNET RMEVTVDTAS
GPFLPSLGRP VRISGASLEY AFQPDAGVLA VAAPARTGDH YVLTARVPGP TDQQVRGAVP
ASGPAAARYL ELPPGMPAEL QDLASRVMSG KSSPYEKLTA LEDFLRDQAN YPVDLNARPG
HSYGALKRFL TGSKADNRGY VEQFATAFAL LARAEGFPSR VAVGYLLDSR SSSAPGRFTV
TSKQAFAWPE VALDGIGWVA FDPTDISKLG ATPPAPSDDQ TPGGEGAAPQ AQTVPPIVKP
ELDRAAQTGG GGAGGARNTL LLALLAVVAA AAAVPVGIVG EKARRRQRRR AGTAAARIGG
AWREVRDRLA ERGVDRSRAL TADEVVARTR ALRGDAAGER VGSLAPVVSS ALFAAAEPGE
AEARHAWELA AAVSQELHRS DSLWRRVVAA VDPRPLLPGR