Gene Franean1_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1117 
Symbol 
ID5669530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1336144 
End bp1337130 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content77% 
IMG OID641240049 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001505477 
Protein GI158312969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGACTC CTGGGGCACC CGGTGGCCGG CTCACGCCCG CCGGGCCGGC GTCTGCCGCG 
CCCGACATCG CGGGCGCGCA GCGGGTCACC TACCGGGTGA GTCAGCGGTT CCGGTACACC
TACGACGGCA GCGCGACGAA CCTCGACCAC CGGCTTGTCG CGGTGCCCCC GCCGCGGCAC
GGCGGCCAGT TCCGCCGGTC CGTCGACCTG CGGGTCTCCG CCCCGGAGGC GCGCACGACC
TGGCAGCGCG GGCCGGACGG CCTTCAGATC GCCAACATCC GGATCGACGT CGTGCCGCCG
ACGCTGGACT TCGATGTCAC CGTCGTCGTC GAGCGGATCG CCCGGGCCGG GTGGCCGACG
CTGCCGGCGT CCGCGCTGAG CAGCCGGCGC CTGCTGACCG CCACCGCTCT CACCTCCCCC
ACCCCGGCGA TGATCGACGC GGCGCGCTCG ATGGCCGGGC CCGACCCGGT CGCCACGGCC
CGCCGGGTGT GCGGATGGGT CCACGAGCGC ATCGCCTACG TCTCGGGCAG CACCGACGTC
GGGACGACCG CCGCCCAGGC GCTCGGCGGC GGGCGCGGCG TCTGCCAGGA CCAGGCCCAC
GTGATGATCG CGATGTGCCG CGCGGCCGGC ATCCCCGCCC GGTACGCGCA GGGGCACATG
CTGGGTGAGG GCGCCTCGCA TGCCTGGGTG GAGGTGCTGG TGCCCGCGGC CCTCGCTCCG
CCCGTCGACG GGGCCTCGCC CGTCGGCGGG GCTGGCGCGC CCGCCGGGGC GGGGGCGGTC
GATCCGGGGC CGGTCGAGGC GTTCGCCCCC GGCGGTGTCG GCGCGGCCGC GGTGGCCTTC
GACCCCTGCC ACGACCGGCT CGCCGACCTG CGGTACGTCA CGGTGGCGGT CGGTCGCGAC
TACCAGGACG TCGCACCGAC CTCGGGTCGC TACGTCGGCG CGGGCCGCGG CGTCCTGGTC
GCCACCGCGC GGGTCGACGT CGTCTAA
 
Protein sequence
METPGAPGGR LTPAGPASAA PDIAGAQRVT YRVSQRFRYT YDGSATNLDH RLVAVPPPRH 
GGQFRRSVDL RVSAPEARTT WQRGPDGLQI ANIRIDVVPP TLDFDVTVVV ERIARAGWPT
LPASALSSRR LLTATALTSP TPAMIDAARS MAGPDPVATA RRVCGWVHER IAYVSGSTDV
GTTAAQALGG GRGVCQDQAH VMIAMCRAAG IPARYAQGHM LGEGASHAWV EVLVPAALAP
PVDGASPVGG AGAPAGAGAV DPGPVEAFAP GGVGAAAVAF DPCHDRLADL RYVTVAVGRD
YQDVAPTSGR YVGAGRGVLV ATARVDVV