Gene Franean1_5110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5110 
Symbol 
ID5673445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6119290 
End bp6121767 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content74% 
IMG OID641243961 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001509375 
Protein GI158316867 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.118743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACCC CACGGCCGTT CTCCGCCGCG CTGGGCGGGA TGGCCTGCCT ACTCGCCAGC 
GCCGCGCTGG CCCCGCTGTT CGACGGGTTC GGCTGGTGGT TCGGCCCCGT CCTGATCGCG
ACGGCGGTCG CCGTGAGCAC CGCGGTGCTG GGCCGGCTGC TCGGCGCCCT GTTCCGGCTG
CCGGCGTCCA CCGGCATCTG CCTCAGCCTG ATCGGCCTGC TCACCACCCT CACCAGGGTG
AGCGCACGGG ACACCGCCCT GCTCGGCGTG TTCCCGACGC CGTCCACCGT GACGGCGCTG
CGCGAGCTGG CGCTGGCCGG CAAGCACGAC ATCGGCGAGC TCGCGGTACC GGTGCCGGAA
CGGCCCGGCC TGGTGGTCCT GGTCTTCGTC GCCGTGTACC TGGTCGTCAT GGCGGTCGAC
CTGATCGTGG TGGTCGTGGA CCGGCCCCCG CTGGCCGGCC TGCCGCTGCT CGGCCTGTTC
GTCGTGCCGG CCGCCGTCCT GCCGGCCGGG GTCGGCACGC TTCCCTTCGT CCTCGCCTCG
GTCGGCTTCG TCGCGCTGAT GCTGCTGGAC GGCAACCGGA TGGTGACCCG GTGGGGCCGC
CCGGTCGGTG ACCGGCCCCC CCGGGTCATC CGCAACGGCC TGGGATCACT CGGTGCCCGG
GTGGCCGTCG GGTCGCTGGT GATCGCGGCG GCGGTGCCAC TGCTGGTGCC CTCCCTGGAC
GGGCACGGAG TGATTGACAA CGGCGGCGGC GGGCGCTCGG GCGACGGCCC GAGCTCGGCG
AGCGTCGTGC AGCCGATCGT CTCGCTCTCC CAGCAGCTCC ATGACGACCG CGAGATCCCG
CTCCTGCGCG TCACCACGGA CAATCCGCAG TATCTGCGGC TCACCGCCCT GGAGAACTTC
GACGGCCAGC GCTTCACCCT GCGGGCCCTG AACGCGACGA AGGAAGCCCG GGTGAGCGAG
GGCCTGCCCG GACCGGAGCG GGGCGTCCGC ACGATCTCGA CCACCGCATC GGTGGCGGTC
TCCGGCGAGA TGGCCGAACG TTACCTGCCC GTGCCGGGCA TTCCCACCGA CTTCGACGGA
CTCGCCGGCG ACTGGCGCCT CGCCGAACCG ACGGGCACCG TCTTCTCCAC CCGCACCTCC
ACCGCCGGGC TGCGCTACAC CGTCAGCGCG GCGGTCCCCG ACCCCACCGC GCAGCAGATC
GCCGCCGCCA CCGGCCCGGT GCCCGAGTCG ATGAACGTCG TCACTCAGCT GCCGCAGGAC
GCCGACCCAC GGCTGCGGAC ACTGCTCGCC CAGATCACCA CCGGCGCGAG CACCGGCTAC
GCCCGGGTCC TCGCCATCCA GAACTTCCTG CGCGGCTCGG AGTTCACCTA CGACCTCAAC
GGCGCACCCA CCGTCCAGGA CGGCGCGCTC AGCGAGTTCC TCTTCGAGAG CCGGCGCGGG
TACTGCGAAC AGTTCGCGTC CGCAATGACC GTCCTGGTGC GGATGCTGGG GCTGCCCGCC
CGGGTGGCCA TCGGTTTCAC GCACGGCACC CGCACCGCCG ACGGCACCTG GGTGATCACC
AACAAGCAGG CACACGCCTG GCCCGAGGTC TGGTTCCCGA CCCTGGGCTG GCTCCCGTTC
GAGCCGACCC GCCGCTCGGA CGGCGCGACC CCGGCCCCGG ACTACGCCCC GTCGACGACC
GAGCCGACCA CCGGCCCGGA CCCGGCCGAA GTCCCCCAGG GGAACGGGGA TGTCGCCGTC
GAGCCGACGC CGAGCGCGGT GCCCGTCCCC GACGACCAGG GCGGGGCGGC CGAGGAGCTG
ACCGCCGAGG CCGACGACAA GGCCGCGGGC ACCCACAAGG GCACCTTCCC CCCCTCCTGG
CTGCCCTGGG TCGGGCTGAG TCTCGGGATC CTGGTCCTGC TCAGCATCCC GGCCCTGTCC
AGGGTGGCGC TGCGGCGCCG CCGGATGGGC TCGGGCGGCC CCGACCGCCA GGACGCCGAG
GCAGTCGCGC GCGTGCACGC GGCCTGGGCC GAGCTGGTCG ACGTGGCCGC CGACCTCGGC
ATCCACCTGC GGACGAGCGA CTCACCGCGC TCGGGCGCGC AGCGGCTTAT CGCCTACCTC
GAGGCCGGCC CCGAAGCCGG GTCGGCGGAG GTCGACGCCG CCCGGCAGGC CCTGATCCGG
ATGGCAATGG CCCAGGAGCG GGCCCGTTAC GCCCCCGCCG GGATGGCCGC GCCCGATCCG
GGCGTGGACG TCCTGGCCGA CCTGGCTCTG GCCCGCCGGG TGCTGTGGTC GGTCGCGCCC
CGGGGCCGCC GCGCGATGGC GACGGTGGCC CCGCCGTCGA TGATGCAGCG GGCGCGGGAA
ATCCCGGTGC GCGATGTTCT CGGGCGCATC CGGCACCGGG CCGACGACCC GCCGGACAAC
GGCGCCGACG ACGACCAGGA GGCCGGGGTC GGTGCCTCCG CCGGCGGGCG GACACACCCG
CCGCAGCCGC CCGCCTGA
 
Protein sequence
MVTPRPFSAA LGGMACLLAS AALAPLFDGF GWWFGPVLIA TAVAVSTAVL GRLLGALFRL 
PASTGICLSL IGLLTTLTRV SARDTALLGV FPTPSTVTAL RELALAGKHD IGELAVPVPE
RPGLVVLVFV AVYLVVMAVD LIVVVVDRPP LAGLPLLGLF VVPAAVLPAG VGTLPFVLAS
VGFVALMLLD GNRMVTRWGR PVGDRPPRVI RNGLGSLGAR VAVGSLVIAA AVPLLVPSLD
GHGVIDNGGG GRSGDGPSSA SVVQPIVSLS QQLHDDREIP LLRVTTDNPQ YLRLTALENF
DGQRFTLRAL NATKEARVSE GLPGPERGVR TISTTASVAV SGEMAERYLP VPGIPTDFDG
LAGDWRLAEP TGTVFSTRTS TAGLRYTVSA AVPDPTAQQI AAATGPVPES MNVVTQLPQD
ADPRLRTLLA QITTGASTGY ARVLAIQNFL RGSEFTYDLN GAPTVQDGAL SEFLFESRRG
YCEQFASAMT VLVRMLGLPA RVAIGFTHGT RTADGTWVIT NKQAHAWPEV WFPTLGWLPF
EPTRRSDGAT PAPDYAPSTT EPTTGPDPAE VPQGNGDVAV EPTPSAVPVP DDQGGAAEEL
TAEADDKAAG THKGTFPPSW LPWVGLSLGI LVLLSIPALS RVALRRRRMG SGGPDRQDAE
AVARVHAAWA ELVDVAADLG IHLRTSDSPR SGAQRLIAYL EAGPEAGSAE VDAARQALIR
MAMAQERARY APAGMAAPDP GVDVLADLAL ARRVLWSVAP RGRRAMATVA PPSMMQRARE
IPVRDVLGRI RHRADDPPDN GADDDQEAGV GASAGGRTHP PQPPA