Gene Franean1_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1603 
Symbol 
ID5670006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1918421 
End bp1919995 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content75% 
IMG OID641240522 
Producthypothetical protein 
Protein accessionYP_001505948 
Protein GI158313440 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGC TCGACGCGAT CACGAACATG GGTCACACAC CGGCGCACGT CCGCCCGCTG 
ACCCCGGGCG ACCGCGCCTA TCTCGCCTTC GTCCGGCGCA ATCCCGGTGA GCACCAGGAC
ATCGGCGCGC TGCTGCACTT CGACGGCCCC CCGCTGGACC TGCCGGCCCT GCGGGCCCAC
GTGGCCGAGC GGCTGCGCGA CCCGCGGGCC CGGATGCTGA CCGACCGTCT CGACACAGTG
CGGGTCTGGT CCCCCGAGCG GGGCGCGTCG GCCGAGGAGA CCCGCTGGGT CTCCGACCCC
GACATGAACC TCGACGACCA TGTCGTCGCC TTCGACCTGC CCCCCGCAGA CGGGGGCGCC
GCGGCGGGGC AGTCCGCCGA CGCGTCCCAC GACGCCCGGC TGCGGGCCGC CGTCGACGCG
ATCGTCGCCC GGCCGATCGA CCTCACCCGG CCGCCGTGGA TGCTCTACCT CCTGCGTGAC
CCGACCCCGG GGGCGACTGG TACCGCGCTG GTCTACCGCT CCAGCCACGT CCAGCAGGAC
GGCTTCGCGC TCTACCGGGT GATGTACCTG CTGTTCGGTG AGAGCGACGA GGTCGATCTC
GGGCTGGCGC CGACGATCCG CCGCCCCCGC CCGGCCGACT ACGCCCGGTT CGTCGGCCGC
GGGATCTCCT GCCTGCTGCC GACCCGGCGC CTCGAGTCCT GGGGCGGCCC ACCGAGCGGC
CCGGCGAGAC TTACCTGGGT GACCACGGAG CTCGCCACGC TGCGCGCGGT GGCCCGCCAG
CACGGCGTCA CCGTGAACGA CGTCTACCTG GCGGCGCTCG CCGGGGCGCT GCGCGCCTGG
TCGCTGCCGG AGTGGGAGCG CAGCGGCCGT CAGCTGCACG CGCTGATGCC CGTCAGCATC
CGCTCCGCGG CCGAACAGGA CGTCCTGTCG AACCACAGCA CCGGGGCGCG CGTCCCGCTG
TTCTGCGGTG AGCCCGACCC GGCCCGGCGG GTGGCCATGA TCGCGGCGGA GACCCGCCGG
ATGAAGCAGG GCGGGCTGGG CCTGGTGGAG CGCCAGCACT TCCCGCTCAT GGCGGCGAAG
GCCTCCCAAC GCATGCTCGC GAACGTCGGC AGCTATCCGG CCCAGATCAA CAAGATGGCG
CTGGTGGCGA CCAACGCCCG GTCGATCCGC GGCCCGCTCT CCATCGCCGG CCGCCGGATG
ACCGGGCTGA TCGGCATGGG CCCGCTGCTC GTCGGACGTC AGCACCTGGC CGTGGCGATG
TTCGGCGTGG ACGACCGGGT CGGGGTCACG TTCGTGGCCA GCGAGAGCGT CCCGGACCAC
GCCCGGCTTG CCGACCTGTG GCTGGCCGAG CTGGCCGCGC TGGGCCGGTC CGACTCGCCC
GTCGGGGTGA GCGTGCCGAC CCAGCGCCTG TCCTCGGCAT CGGCCGCGGT GACGGCCGTC
ATGGCCGGCG CGGGGCCCGG CGCTGTCTCG GGTGCCGTCT CAGGTGCGGT GACCGGTGCC
GTCCGGACCG AGGTCGGCGC CCTCATCCGC CCCTGGCGCC GCCGCCCCAC CACCCCGGCA
GCCCCCGCCA TGTAA
 
Protein sequence
MGLLDAITNM GHTPAHVRPL TPGDRAYLAF VRRNPGEHQD IGALLHFDGP PLDLPALRAH 
VAERLRDPRA RMLTDRLDTV RVWSPERGAS AEETRWVSDP DMNLDDHVVA FDLPPADGGA
AAGQSADASH DARLRAAVDA IVARPIDLTR PPWMLYLLRD PTPGATGTAL VYRSSHVQQD
GFALYRVMYL LFGESDEVDL GLAPTIRRPR PADYARFVGR GISCLLPTRR LESWGGPPSG
PARLTWVTTE LATLRAVARQ HGVTVNDVYL AALAGALRAW SLPEWERSGR QLHALMPVSI
RSAAEQDVLS NHSTGARVPL FCGEPDPARR VAMIAAETRR MKQGGLGLVE RQHFPLMAAK
ASQRMLANVG SYPAQINKMA LVATNARSIR GPLSIAGRRM TGLIGMGPLL VGRQHLAVAM
FGVDDRVGVT FVASESVPDH ARLADLWLAE LAALGRSDSP VGVSVPTQRL SSASAAVTAV
MAGAGPGAVS GAVSGAVTGA VRTEVGALIR PWRRRPTTPA APAM