Gene Franean1_3615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3615 
Symbol 
ID5671983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4285362 
End bp4287029 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content73% 
IMG OID641242500 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507920 
Protein GI158315412 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCG GGTGCACCCC GTGGCCGGAG GAGGTGGCTC GCGGCTACCG GGAGAAGGGC 
ATCTGGCGGG GGCAGACGAT GGGCGCGCTG CTGGCCGATC TCGCCCGCCG CCACCGCGAC
AGCACGGCGC TGATCCACCG CGACCGCCGG ATCAGCTACA CCGAGCTGGA CGCCTGGGCC
GACCGGCTGG CCGCCGGGTT CGCCGCGCAC GGTGTCGCGC GCGGCGAGCG CGTGGTGGTG
CAGCTGCCGA ACACGCCGGA GTTCATCGCG ATCGTGTTCG GGCTGTCCCG GATCGGCGCC
GTCCCGGTGT TCTCGCTGGT CGCGCACCGG GCGACCGAGC TGACCCACCT GGTGCGGCTG
TCCGGGGCCA CCGGATACGT GCTGCCCGAG TCCTACCGCG GCGTCGACCA CCTCGCCCTG
GCCCGGCAGC TCCGGGCGGC GACCGACACG CTGCGGACGA TGTTCGTCCT CGGTGACGCC
GCCGACGGCT TCGTCGCCCT CTCAGCGGTC GAGGCCGCCG GCGACGTCGG CCGCGTCGGC
GCCGGCATCG CCGCGTACGA GGCCCCGCGG GAGCCCATGC CGCCGGCCGC CGACCCGTCC
GACGTGGCGT TCTTCCTCCT CTCCGGCGGG ACGACGGCGT TGCCGAAACT GATCCCGCGC
ACCCACGACG ACTACGTGTA CCAGTCCGAG CTGGCCGCCC AGGTGTGCGA GATGTCCGCC
GATGACGTCT ACCTGGCCGC GCTGCCCGTC GAGTTCAACT TCGCCTTCGG CTGCCCGGGA
GTGATCGGCA CGCTGCAGAC CGGCGGGACG GCGGTGCTCG CCGACACTCC GAACCCGCTG
GACTGCTTCC TGCTCGTGGA ACGGCACGGC GTGACGGTGA CCGCGATGGT CCCCTCCGTC
GTGGCGCTGT GGCTGGACGC CGCCGAGTGG AACGACGCGG ACCTGTCGAG CCTGCGCCTG
GTCCAGGTCG GCGGCGCCCG GATGACCCGC GAGTTCACCG CCCGCATCGG GCCTGGCCTG
GGCTGCTCGC TCCAGCAGGT GTTCGGGATG GCGGAGGGCC TGCTCTGCTT CAGCCGCCCC
GACGACCCGG CCGAGGCGGT GCTGACGACG CAGGGCCGCC CGATCTCGCC CGCTGACGAG
GTGCTCATCG TCGGGCCGGA CGGCGACCCG CTGCCCGGCG GCGAGATCGG CGAGCTGGTC
ACCCGTGGTC CGTACACGCT GCGCGGCTAC TACCGGGTGC CGGAGTACAA CGCGCGGGCG
TTCACCCCGG ACGGCTTCTT CCACACCGGT GATCTCGCCC GGCTGACCCC GGCCGGCGAC
CTGGTGATCG AAGGCCGGAT CAAGGAAATG ATCATTCGGG GCGGGGACAA GATCTCGGCC
GGCGAGGTCG AGGACCACCT GCTCGCCCAC CCCGGCGTCA CCGCGGCGGC CGTGACCGCC
GTCCCCGACG ACCTGCTCGG TGAGCGGATC TGCGCCCACC TGATCGTCGA CGGGCCGGCC
CCGTCGCTGG CCGAGCTCAA GCGGGCCATG CACGCGCGCG GCGTCGCCGA CTACAAGCTG
CCCGACGCCG TCCGGTTCGT GACCGAGTTC CCGCTCACCC CGCTCGGGAA GATCGACAAG
TTGGCGCTGG CCGCGGCGGC CGCGTCCGAA CGGAAGGCTG ACGTGTGA
 
Protein sequence
MLAGCTPWPE EVARGYREKG IWRGQTMGAL LADLARRHRD STALIHRDRR ISYTELDAWA 
DRLAAGFAAH GVARGERVVV QLPNTPEFIA IVFGLSRIGA VPVFSLVAHR ATELTHLVRL
SGATGYVLPE SYRGVDHLAL ARQLRAATDT LRTMFVLGDA ADGFVALSAV EAAGDVGRVG
AGIAAYEAPR EPMPPAADPS DVAFFLLSGG TTALPKLIPR THDDYVYQSE LAAQVCEMSA
DDVYLAALPV EFNFAFGCPG VIGTLQTGGT AVLADTPNPL DCFLLVERHG VTVTAMVPSV
VALWLDAAEW NDADLSSLRL VQVGGARMTR EFTARIGPGL GCSLQQVFGM AEGLLCFSRP
DDPAEAVLTT QGRPISPADE VLIVGPDGDP LPGGEIGELV TRGPYTLRGY YRVPEYNARA
FTPDGFFHTG DLARLTPAGD LVIEGRIKEM IIRGGDKISA GEVEDHLLAH PGVTAAAVTA
VPDDLLGERI CAHLIVDGPA PSLAELKRAM HARGVADYKL PDAVRFVTEF PLTPLGKIDK
LALAAAAASE RKADV