Gene Franean1_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2800 
Symbol 
ID5671189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3314242 
End bp3315441 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content72% 
IMG OID641241709 
Productlanthionine synthetase C family protein 
Protein accessionYP_001507129 
Protein GI158314621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACAC TGTCCCGCCT CCCGACGGTC CCAGACCTCG GCGACGCCGA ATCGGCGCGC 
TGGGCTCAGT CCCTCGGCGA CGGAGCACCA GGGATCGCAC TGGCGCATAT CGCTCGGGCC
CGTGCCGGCC TCGACGGCTG GGAACCCGTG CACCGTCTCG CGGCGGCGAT GACACGTAGT
CCGCTCAACG CCCATCCGGA CACCGCCAGC CTGTACCAGG GCGCACCGGC CGTCGCCTAC
GCCCTGCATA TCGCCGGCCA CCGGGCCTAC GACGCCGCGC TCGCCACCCT CGACGAGGCC
ATCGCCACCG TCATCCGGCG CCGTCTGGAG GCCGCCCACC GCCGCATCGA TCACGAACAG
CTGCCGCACG CCGGCGAGTA CGACCTGATC AACGGGCTCA CCGGCCTCGG CGCACTCCTC
CTACACCACG ACCGCGAAAG CGCTCTTCTC CGGGACGTAC TCGCGTACCT GGTGCGGCTG
ACCCGACCCA TCCGTGTCGA TGGCCGCGAC CTGCCCGGCT GGTGGGCAAC GGGCAGCCCC
GACCGCCGCG CCTCCGCCCG ATGGAACGCC GGCCATGCCG GCTTCGGCAT GGCCCACGGC
ATCGCCGGGC CGCTGGCGCT CCTGGCCATC ACCATGCGGC GGGGGATCGC CGTGGCGGGA
CACGTCGACG CGCTCCACAA CATCATCGCG TGGCTCGACC AGTGGCGCAG AGGGCAGAGG
CGGACTGGCT GGTGGCCCGA GGCGATCGAC CACGACGAGC TGCGTACCGG CAGCGCAGCC
TCCCCGGGGC CACCTCGGCC GTCCTGGTGC TACGGCAGTC CCGGCATCGC CCGAGCCGAA
CACCTCGCGG CCCTTGCCCT CGGCGACCAA CAACGAGCCT TCGATGCGGT CGAGACCCTC
ATCGGATGCC TCAGCGACGA CCACCAGCTC GCGCAGCTCA CCGACGCGGG ACTCTGCCAC
GGCTGGGCCG GCCTCCTCCT GACCGCTCAT CGGGCCGCCG CCGACACCAG CACCGGCGAA
CTCTCCGCCG CCCTGCACGC CGCCGAAACA CACAGGCACC GGTACCTCCG CGGCAACAGC
GACCCCACCG CCGCGGGTTT CTTAGACGGC GCGGCTGGCA TCGCACTCGC CCATGCCGCC
CTGAAGATCA CATCCGGCTC GGCGCTGCCA GACTGGGACC GCTGCCTGTT GATCAACTAG
 
Protein sequence
MTTLSRLPTV PDLGDAESAR WAQSLGDGAP GIALAHIARA RAGLDGWEPV HRLAAAMTRS 
PLNAHPDTAS LYQGAPAVAY ALHIAGHRAY DAALATLDEA IATVIRRRLE AAHRRIDHEQ
LPHAGEYDLI NGLTGLGALL LHHDRESALL RDVLAYLVRL TRPIRVDGRD LPGWWATGSP
DRRASARWNA GHAGFGMAHG IAGPLALLAI TMRRGIAVAG HVDALHNIIA WLDQWRRGQR
RTGWWPEAID HDELRTGSAA SPGPPRPSWC YGSPGIARAE HLAALALGDQ QRAFDAVETL
IGCLSDDHQL AQLTDAGLCH GWAGLLLTAH RAAADTSTGE LSAALHAAET HRHRYLRGNS
DPTAAGFLDG AAGIALAHAA LKITSGSALP DWDRCLLIN