Gene Franean1_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0636 
Symbol 
ID5669053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp738776 
End bp741268 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content78% 
IMG OID641239563 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001505001 
Protein GI158312493 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00254265 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.770826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG CCGCGGGCCC GGCCCGCCGA GCCACCTTCG CCCCGAGCGA GGCCGGTCTA 
CGCCGCATCG CCGCGCTGGC CGTCGTCGCC GCCCTCGCCC TGGTGGCCGG GACGGGATTC
ACCCGCCTCT TCCCGACCCG GGACCTGTGG GTGCTGCTGC CGGTGGCGGC GGTCCTGCCC
GTCGTCCTGG TCGGCGCGCT CTCCCGGCAC GGCCGCCCGG TCTCGCCGGC GCTGACCGTG
CCCGTATGGC TGGCCGGCTT CGTAGCCTGG ACGGCTTACA CCGTCGCCGC CGGATCGGGC
GATCTCATGG CCCGCCTCGA TGTGGTCCGC ACGGGCGTCG TCGACGGGTG GGCCCGGGTC
CTCGACATGG GCGTCCCCGC CCCGGCCGAC CCCGACCTGC TGATCGTCGC GCTCGCCCCC
ACCTGGCTCG CCGCGGCGCT CGGCGCCGAG CTCGTTGTGC GGACCCGCGC GGCGCTCGCC
CCGGCACTGC CCGCCGCCGT AGCCCTTCTC GCCGCGTCCG CGCTGGCCGT CCCCGCCCCC
GGGGACAACC TGGCCCGGGC CGGTGCCCTC GCCGCCCTGA CCGCGCTGTT TCTCATGATC
AGAGCACCCC GGGCCGCGGG CCGCGGCCCG CGCCGTGAGC TCGCGCGCCG GGGCGGCGCG
ACACTGCTCG TGGTCGTGGT CGGCGTGCTC GCCGGCACGG CCGCGACGAG GGCCGGCGGC
GGGGACCCCG TCGACCCCCG CACCCACCGC TCGACACCAC CGGTGAGCCA GACCGAGCCG
AGCCCGCTCA GCGCACTCGG CGGCTGGACC GCCCATCCGG ACGAGGTGCT CTTCCACACC
GACGTCAGCG GGCCGGCCCC GGCCGAGCCG GTGACCCTCC GGCTCGCCGT CCTCGACTCC
TACGACGGCG CCCAGTGGCG GTCGACCGCC CGTTTCGTTC GGGCCGGCTC GGGGCCACCC
TCATCGCAAG CCGGGATCGC CGACGACCCG GCCGGCGATC CCGCTGCCGG CGACCGGGCC
GCTGATCCAG CAGGTGACGG GCCCGGCCCG GCACCGGTGG GCGAGATGCG GCAGGTCATC
GAGATCGCCG GACTGGGCGG GCGGATCCTC CCGTCGGACG GGCGGCTGGT GGGCGCCCCC
ACCGGCGTCC GGGTCGATCC CGGCACCGGG ACCCTGCTGA ACGACCGTCC GCTGCTTCCC
GGCGACCGTT ACGAGATCAC TTCAGCGCCG GACCCCCGGC CCGCGCCGGC GGATCTCCCC
AGGTTCGACG CCGGCACCGC CCGAACGGGC GGGCCCGATC CCGACCTCGA GGTCCCCGCC
GACCCGCCGG CCATCCTCGG CCGTCTGGCG GATATCGCGA CGGCCCGGGG GAGCACCCCG
TTCCAACGGG CCGCGCTGCT GCGGCAGTAC CTGAGCGCGA CCTTCACCTT CGACCCGGCC
GTCCCACCCG GGCACTCGTA CGGGCACATC GACCACTTCC TCGCCCACAC CCACCGCGGG
ACGTCCGAGC AGTTCGCGAC CGCCTTCGTC CTCGCGGCCC GCATCCTTGG GCTGCCCGCC
CGGCTTGCCG TGGGATTCAC CGCGCTCCCC GCGGCGGACG GCCAGCCCCG CACCGTGCAC
GGCGCCGACG CGCTCGCCTG GGCCGAGGTG CGCTTCGACG GCGCCGGCTG GCTGCCGTTC
TTCCCCACCC CGCCGGCCGC GGACGCCCGC GGCGCGAGCG TGGCGGGATC GAATCCGGGC
GAGACCCCCG AGCAGGCGGA ACTGATCGAC GTCGCCCTGC GCTCCGCGGT CGAGACCACG
GCACCGGCCA CCCGGGCCAC GGAGGACTCC ACCCCAGCGG CCACGCCGCC CGGCCCGGGT
GCCTGGTCAC GTTGGGCCAT CCGGAGCGGC CTGGCGATCG CCGGCGCCGC GGCGCTGTAT
CTCGCGCTCG CCGTCGCGCT GCCCTCGCTC CGGCGGGGCC GCCTGCGCCG CGCGGGCCAC
CCACGACGCA GGGTGGTCGA CGCCTGGCGG CAGGCCGTCG ACGCGCTCGC CGACGCGGGC
CTGCCCGTCC CGGCCGCCGC GAGCCCGGGA GAAGTCGCCC GGCTCGCCGC GGCCGAGGTG
GGCCCGTCCG GCGAGACGGC GATCCGCGAG CTGGCCGACA TCGTGACACT CGCCCTGTTC
GCCCCCGTGA CCGCGGTGGA ATGGGAGGGC ACCCCCGGCC GCCGCGCCGC GGACGAGGCC
TGCCGCCTCC TCGACCGGTT CGAGCGGGCC CTGCGCGAGC GCACGACACG CCGAGCGCGG
ATCCGCCGGA CGCTGGCCCC GAGCACCGTG GCCACCGAGC TCCGCCGGCT GCGGGATGCA
CGGGACCCGC GGGATCCGGG AGATCCGCCG ACCGGGGCCG CGGGGGACCC GGACGGCCCG
TGCGGCCCCG GCACCGACCA CGGGCGGCCC GATCCGGGCA CCCCGCCGGC CGGGTATGTC
TCGGACGGGA AGCGCTTTGC CCCGGCGGCG TGA
 
Protein sequence
MTGAAGPARR ATFAPSEAGL RRIAALAVVA ALALVAGTGF TRLFPTRDLW VLLPVAAVLP 
VVLVGALSRH GRPVSPALTV PVWLAGFVAW TAYTVAAGSG DLMARLDVVR TGVVDGWARV
LDMGVPAPAD PDLLIVALAP TWLAAALGAE LVVRTRAALA PALPAAVALL AASALAVPAP
GDNLARAGAL AALTALFLMI RAPRAAGRGP RRELARRGGA TLLVVVVGVL AGTAATRAGG
GDPVDPRTHR STPPVSQTEP SPLSALGGWT AHPDEVLFHT DVSGPAPAEP VTLRLAVLDS
YDGAQWRSTA RFVRAGSGPP SSQAGIADDP AGDPAAGDRA ADPAGDGPGP APVGEMRQVI
EIAGLGGRIL PSDGRLVGAP TGVRVDPGTG TLLNDRPLLP GDRYEITSAP DPRPAPADLP
RFDAGTARTG GPDPDLEVPA DPPAILGRLA DIATARGSTP FQRAALLRQY LSATFTFDPA
VPPGHSYGHI DHFLAHTHRG TSEQFATAFV LAARILGLPA RLAVGFTALP AADGQPRTVH
GADALAWAEV RFDGAGWLPF FPTPPAADAR GASVAGSNPG ETPEQAELID VALRSAVETT
APATRATEDS TPAATPPGPG AWSRWAIRSG LAIAGAAALY LALAVALPSL RRGRLRRAGH
PRRRVVDAWR QAVDALADAG LPVPAAASPG EVARLAAAEV GPSGETAIRE LADIVTLALF
APVTAVEWEG TPGRRAADEA CRLLDRFERA LRERTTRRAR IRRTLAPSTV ATELRRLRDA
RDPRDPGDPP TGAAGDPDGP CGPGTDHGRP DPGTPPAGYV SDGKRFAPAA