Gene Franean1_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1240 
Symbol 
ID5669653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1488663 
End bp1491425 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content77% 
IMG OID641240172 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001505600 
Protein GI158313092 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0677298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00305841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGCCG GCCCCGATCC TGGCCTGCCG CGCGACTACC CCGCCCACTG GGAGGCCGAC 
GTCATCCTGT CGGACGGCGG GACGGCGCAT ATCCGTCCGA TCCGGCCGTC GGACGGGGCG
CTGCTGCGGC CGTTCTGGTC CCGGCTGTCC CAGCGGACGA TCTACTTCCG GTACTTCAAC
GTCCGGCGCG GGCTCAGTGA CGAGGACATC GCCCGGACGA CCAACGTCGA TCAGTGGGTC
CGCGGCGCCC TGGTGGCGCT GATCAGCGGC GAGATCGTCG CCCTCGCGCA CTGGGAGGGC
CGCCCCCGCT CTGCCGCCGA CGCGGCCGGT CCACCGAACG CCCCCGGCGC CCCCGGCACC
CCCGACACCA CCAGCACCCC TGACGCTGCC GGAAGCCGCG GGAGCGCCGA TGACCGTCCG
GCGCCCGACG CGGAGGTCGC CTTCCTCGTC GAGGACGCCC AGCAGGGCCG CGGGCTCGGC
TCGGTGCTGC TGGAGCACCT GGCCGCGGCC GCGGCCGAGC GCGGGGTGCG CCGCTTCGAC
GCCGACGTCC TCAGCGAGAA CCAGCAGATG ATCCGGGTGT TCCTCGACGC CGGCTACACC
GTGGCGCGCG CCTGGGAGTC CGGCGGGGTC CGGCTGTCGT TCGACATCGC GCCGACGGCC
CGCTCGGTGG ACGTCATGCG CGCCCGGGAG CACCGGGCCG AGGCCGCGTC GATGAACCGG
CTGCTGCATC CCAGGGCCAT CGCGGTCGTC GGCGCCGGCC GGGACCGCTC CTCACTGGGC
AACATCGTCC TGCGCAACCT GCTCGCCGGC GGCTTCGACG GCCCGGTGTA CCCGGTCAAC
CCGGCCGCCG CGGCGGGGGA GGGTGCCGTC GCCTCGGTCC GGGCGTACGC CTCGGTGGAG
GACACGCCGC GGCCCGTCGA CCTCGCGGTG CTCTGCGTGT CCGCGGAGGT GATCCCGGCG
GTCGTCGCCG CCTGCGGCCG GCACGGCGTG CGCGGTCTGG TCGTGGTGAC CGACCAGCGG
GACGACGCCG CCGACGCGCG GCTCGCCTCC GACGCCCGCG CGAACGGTAT GCGGGTGGTC
GGCCCGGCCA GCCTCGGCAT CCAGAACCCG GCGGTGGGGC TGAACGCCTC GCTGGTCGAG
CGGATGCCGC CGGCCGGCCG CATCGGCTGC TACTCGCAGT CCGGGCCGCT CGGCGGGGCG
CTGCTGGAGG CCGCCGCGGG CCGTCGGCTG GGGTTCTCGG TCTTCGTCTC CGCGGGCGAC
CGCGCGGACG TCAGCGGCAA TGATCTCCTG CAGTACTGGG AGGCGGACCC GTCCACGGGT
GTGGCGCTGA TGCACCTGGA GACCTTCGGG AACCCGCGCA AGTTCGCCCG GCTGGCGCGC
CGGCTCGGCC GCGACACCCC GGTCGTGGTC GTGCTCTCCG AGCGCACCCC GCTGGACGAG
GCCCTGCTGC GCCAGGCCGG GGTGATCGGC GTCGACCGGG TCTCGCAGGG CCTGGACGTG
GCGCTGCTGC TCGCCAACCA GCCGCTGCCG GGGGGCAACC GGGTCGCCGT GGTCGGCGAC
TCACGGGCCC TGGTGGGGTT CACCGCCCGG GCGGCCGACG CGGCCGGGCT CGCGGTGCGG
GAGGTGCTGC TGCCGGTCGG CAGCACCGCC GAGGCGTTCC GCGACGCTCT CGTCTCGGCC
TCGGCCGAGG TGGACGCCCT GCTCGTGATC GCGGTGCGGC TGCCGTCGTC GCTGCCCGGC
CTCGCGGCCG CGGGAGTCGG CGTGGCCGCG GGAATCGCCG CCGCCGCCGC CGCGCCGCTG
GTGCGGGTGC CCCTGCTGGC GACCGTGCGG GCCACGGAGG CGTCGCCGGA GCTCGGCGCG
ATACCGGCCT ATCCCTCGCC CGAGGGCGCG GTCGCGGCGC TGCGCCGGGC CGTCGGCTAC
GCCCACTGGC GGGCCCTGCC CTCCGGGGCG GTGCCGGCCA CCCAGGTGCG GGCCGAGGAG
GCCCGCCGGC TCGTGGCGGG GACGACCGGC CGCCTCACCG ACGGCGCGGC CGGCGAGCTC
CTGGCCTGCT ACGGCATCGA GGTGGTGCCC CGCCGGGTCA TCGGCGGCGC TGATGAGGCG
GTCGAGGCCG CCGCCCTGCT CGGCTGGCCC GTGGTGCTCA AGGCGCTGTC CGACGGGTAC
CGGCACCGGC CGGACCTCGG GGGGCAGCGC CTGGACCTGC CCGACCCGGC GGCCGTGCGC
GCCGCGTGGC GCTCGCTGGC CGAGCGGCTC GGGCCGGGCG CGCCGATCGT CGCGCAGCGG
ATGGTCCCCG GCGGGGTCGC GGTGGTCGCC GGAGCCGAGC AGCATCCCCG GTTCGGCCCG
CTGGTGTCGT TCGGGCTGGC CGGCCCGGCC ACCGAGCTGC TGGGTGACCG GGTGCACCAC
ATCCTCCCGC TGACCGACGC CGACGCCGCG CGCCTTGTCC GCTCGGTGCG CGCGGCGCCG
CTGCTGTTCG GCTACCGCGG CGCCGAGCCG GTGGACGTCG CCGCGCTCGA GGATCTGCTC
CTGCGGCTGG CCCGCCTCGT GGATGACATC GGCGGGGTGA AGCACCTCAC ACTCGAGCCC
GTGATCGTCT CGGTGGACCG GGTAAGCGTG CTGTCCGCGG ACATCGTCCT GGCACCGCCA
ACCCCTCGCG CGGACGCCGG TCCGCGCCGG TTCTGGCGCC CCGTCGCGGA CCTGCCGGCC
AGGCCGGAGC CCCGGGCCGG TTCGGCACAA CGTGTCCCCA CCGTCCACAA TCGTCTGCCA
TGA
 
Protein sequence
MNAGPDPGLP RDYPAHWEAD VILSDGGTAH IRPIRPSDGA LLRPFWSRLS QRTIYFRYFN 
VRRGLSDEDI ARTTNVDQWV RGALVALISG EIVALAHWEG RPRSAADAAG PPNAPGAPGT
PDTTSTPDAA GSRGSADDRP APDAEVAFLV EDAQQGRGLG SVLLEHLAAA AAERGVRRFD
ADVLSENQQM IRVFLDAGYT VARAWESGGV RLSFDIAPTA RSVDVMRARE HRAEAASMNR
LLHPRAIAVV GAGRDRSSLG NIVLRNLLAG GFDGPVYPVN PAAAAGEGAV ASVRAYASVE
DTPRPVDLAV LCVSAEVIPA VVAACGRHGV RGLVVVTDQR DDAADARLAS DARANGMRVV
GPASLGIQNP AVGLNASLVE RMPPAGRIGC YSQSGPLGGA LLEAAAGRRL GFSVFVSAGD
RADVSGNDLL QYWEADPSTG VALMHLETFG NPRKFARLAR RLGRDTPVVV VLSERTPLDE
ALLRQAGVIG VDRVSQGLDV ALLLANQPLP GGNRVAVVGD SRALVGFTAR AADAAGLAVR
EVLLPVGSTA EAFRDALVSA SAEVDALLVI AVRLPSSLPG LAAAGVGVAA GIAAAAAAPL
VRVPLLATVR ATEASPELGA IPAYPSPEGA VAALRRAVGY AHWRALPSGA VPATQVRAEE
ARRLVAGTTG RLTDGAAGEL LACYGIEVVP RRVIGGADEA VEAAALLGWP VVLKALSDGY
RHRPDLGGQR LDLPDPAAVR AAWRSLAERL GPGAPIVAQR MVPGGVAVVA GAEQHPRFGP
LVSFGLAGPA TELLGDRVHH ILPLTDADAA RLVRSVRAAP LLFGYRGAEP VDVAALEDLL
LRLARLVDDI GGVKHLTLEP VIVSVDRVSV LSADIVLAPP TPRADAGPRR FWRPVADLPA
RPEPRAGSAQ RVPTVHNRLP