Gene Franean1_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1333 
Symbol 
ID5669744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1605101 
End bp1606411 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content74% 
IMG OID641240264 
Productlanthionine synthetase C family protein 
Protein accessionYP_001505691 
Protein GI158313183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.551355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG TCATCGACAC GCGAGTACGC GCGGCCACCG TCGCGACGCG GCTGGCCGAC 
GCGCTGACGG TGCCGCCACC ACCCGAGCCA GACGGCGACC GGAGCCCGAG CAGCCCACGC
TGGCAGGGCC AGTCGCTGGC CGAGGGAGCG GCCGGCATCG CGGTTCTCCA CGGCGTACGC
GCCCGCGCCC ACGCTGGGGA GTGGGCCACG GTCGATGCCT GGCTGACGGC CGCTGCCAGG
GAAGACCTTT CGGTCGGGCC GGGTGCGGGC CTGTGGTTCG GCGCCCCGGC GCTTGCGCTC
GCGCTGACCG CGGCAGCCCC ACCCGGCCGC CACCTCGGCG CGGCCCGGCA GCTGCACACC
GCCGTCGAAA GGCTGACCGA GCGCCGGCTC GCGGCGGCCC ACGCCCGGAT TGATGCCGGA
CAGCGACCGG AGCGCGCCGA GTTCGACCTG GTCCGAGGCC TGACCGGGCT CGGCGCCTAT
CTGGCGACCC GCAACCCCGA CGGCGAGCAG CTCCGCCAGA TCCTGACCTA CCTTGTCCGA
CTCACTGAAC CGCTACCCGC CACGGACACG GCCGGGCTGG CCGCGCCAGG CTGGTGGACC
ATCGACGTTC CCACCACCGC GCCACCCGGA CCGTTCGCCG ACGGCCATGC CGATCAGGGC
ATGGCCCACG GCATCGCGGG GCCGCTCGCA CTGCTGGCGC TCACACACCG CCGTGGGGTC
ATCGTCCCCG GCCACACCGA CGCCCTCGAC CGGATCTGTC ACTGGCTGGA CACCTGGCGC
CAGGACGGCC CCGCCGGGCC CTGGTGGCCC GAACGGATCA CCGCCAGTGA GTTGCTGACA
GGCCGGGCCG CCCAGCCCGG CCCAGGCCGC GCATCCTGGT GTTACGGCAC TCCCGGCCTG
GCCCGCGCCC AGCAGCTCGC CGCGGTCGCG CTGGCCGACA CCACCCGGCA ACAACGCGCC
GAGGCAGCCC TCGCAGCCTG CGTCACGGAC CCCGCCCAGC TCGCCCGGTT CGTCGACCCG
GCGCTCTGCC ACGGCTGGGC GGGCCTGGTC GCCACCGTTC GCTGCGCAGC CGCGGATGCC
CGCTTCTACC CGCTCGACAG CCACCTACCC AGCTTGGTCA AGCAGCTCCT CGACAGCCTC
GACGCGGCGC AAGGCGCCGA CTGGCAGCTA CCCGGCCTCA TTGAGGGCAC GGCAGGAATC
GCCGCGGTCC TGCATGCCGT GGCGACCAAC ACCACCACAG CCTGGGAGTC CGCCCTCCTG
CTCGACCTCC CCCCGGCTTG GCCGGCAGGG GACCGGGGAG CCGAAGCATG A
 
Protein sequence
MTVVIDTRVR AATVATRLAD ALTVPPPPEP DGDRSPSSPR WQGQSLAEGA AGIAVLHGVR 
ARAHAGEWAT VDAWLTAAAR EDLSVGPGAG LWFGAPALAL ALTAAAPPGR HLGAARQLHT
AVERLTERRL AAAHARIDAG QRPERAEFDL VRGLTGLGAY LATRNPDGEQ LRQILTYLVR
LTEPLPATDT AGLAAPGWWT IDVPTTAPPG PFADGHADQG MAHGIAGPLA LLALTHRRGV
IVPGHTDALD RICHWLDTWR QDGPAGPWWP ERITASELLT GRAAQPGPGR ASWCYGTPGL
ARAQQLAAVA LADTTRQQRA EAALAACVTD PAQLARFVDP ALCHGWAGLV ATVRCAAADA
RFYPLDSHLP SLVKQLLDSL DAAQGADWQL PGLIEGTAGI AAVLHAVATN TTTAWESALL
LDLPPAWPAG DRGAEA