Gene Franean1_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1820 
Symbol 
ID5670222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2184314 
End bp2186128 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content71% 
IMG OID641240741 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001506164 
Protein GI158313656 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.591607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGAGT ACACCTCTCC CGCCCGAGTC ACCGTGGCGG ATGACATGAC TCTCAGCGAT 
GCCGTATTCG CCAACGCGTC ACGGACACCC GAGAAGGTCA TCGTCCGCCA CAAGTCGGGT
GGCCAGTTCG TTGACGTGAC CGCGCAGGAG TTCCGCGATC TGGTGGTCCG CACGGCCGCC
GGCCTCGCCG CCCGCGGGGT GCACCCCGGC GACCGCGTCG CGATCATGAG CCGCACTCGG
TACGAATGGA CGGTCGTCGA CTACGCGGTC TGGGTAGCCG GCGCGGTCAC CGTGCCGATC
TACGAGACGT CCAGCGCCAG CCAGCTCGAG TGGATCCTGT CGGACTCGGA GGCCGTTCTC
ACCGTCGTCG AGTCCGAGGC CAACGCCGCA CTCGTCGCCA CGGTGCGCGA CCAGGTTCCC
ACCCTGCGCG AGGTGCTCGC CCTGGACGGC GGCGCGCTCG ACACCCTGGC CGAGGCCGGG
GCCGCCGCGG GGACCGGCGA GGAGGAGCTC GCCGCCGCGC GGGCCGGGGT GACCGCGGCC
AGCATCGCCA CGATCATCTA CACCTCCGGC ACCACCGGCA GACCGAAGGG CTGCGAGCTC
GCCCACCGCT CGCTGCTGTT CAACGCGATG AGCTCCGCGG CGACCATGCC GGACCTGTTC
ACCGAGGACG CGTCCACGCT GATGATCCTG CCGCTGGCCC ATGTGCTGGC GCGGACGATG
CAGTGCACCA TCATCAACAG CGCGCGGTGC ATCGCCTACG CGCCGGACAC ATCCACGCTG
CTGGCCGACC TGGCCCAGGT CCGCCCGAGC TTTCTGCTCG CGGTGCCGAG GGTCTACGAG
AAGGTGCACG CCGGCGCCCG CGCCAAGGCC CACGCCGACG GGAAAGGCCG GATCTTCGAC
GCCGCCGAGG CCACCGCGAT CGCCTACAGC GAGGCCCTCG ACCACGGCGG GCCCGGCTTC
CTGCTGCGCG CTCGGCACGC GTTGTTCGAC CGCCTCGTCT ACAGCAAGCT GCGTGCCGCG
ATGGGCGGCC GGATCGACCA CGCCATCAGC GGCGGTGCGC CGCTGGGCCC ACGCCTCTGC
CACTTCTACC GGGGCATCGG CGTTCCGATC TTCGAGGGGT ACGGGCTCAC CGAGTCCACC
GCGGCCGCGA CCGTGAACCG GCCCGATTCG CTGAAGATCG GCACCGTCGG CCTGCCGCTG
CCCGGCGTGA CGATCCGCAT CGCCGACGAC GGGGAGATCC TCATCCGCGG CGACCTGGTC
CTGAGCGGCT ACCGCAACGA CGAGACGGCG GCCAAGGAGG CGCTCGACGC CGACGGCTTC
CTGCGGTCCG GTGACCTGGG CTCCCTCGAC GAGACCGGAC ATCTGCGCAT CACCGGGCGG
AAGAAGGAGC TGCTCGTCAC CGCGGGCGGC AAGAACATCG CCCCGGCGCC CCTCGAGCAC
CGCATCCAGG AGAACCCGCT GATCAGCCAG GCGATGCTGA TCGGCGACCA GCGGCCGTTC
ATCGCCGCCC TGATCACCCT CGACCCGGAC GCCTTCGCCT CCTGGCGCGA CACCCACGGC
CACCCGTCGA CGGTGACACC CGCCGACCTC GCCACCGACC CGGAACTGCT GGCCGAGGTC
CAGAAGGCCG TGGACGCGGC GAACGCCACC GTCTCCCACG CCGAGTCGAT CAAGAAGTTC
GTGATCCTGC CGAACGACTT CACCGTCGCC GGCGGCGAGC TCACCCCGAG TCTCAAGGTG
AAGCGCAACC TCATCTTGGA ACGCCACGCG GCAGTCGTGG AGTCCATCTA CGCGGGGGCC
CGGACCGCCT CCTGA
 
Protein sequence
MREYTSPARV TVADDMTLSD AVFANASRTP EKVIVRHKSG GQFVDVTAQE FRDLVVRTAA 
GLAARGVHPG DRVAIMSRTR YEWTVVDYAV WVAGAVTVPI YETSSASQLE WILSDSEAVL
TVVESEANAA LVATVRDQVP TLREVLALDG GALDTLAEAG AAAGTGEEEL AAARAGVTAA
SIATIIYTSG TTGRPKGCEL AHRSLLFNAM SSAATMPDLF TEDASTLMIL PLAHVLARTM
QCTIINSARC IAYAPDTSTL LADLAQVRPS FLLAVPRVYE KVHAGARAKA HADGKGRIFD
AAEATAIAYS EALDHGGPGF LLRARHALFD RLVYSKLRAA MGGRIDHAIS GGAPLGPRLC
HFYRGIGVPI FEGYGLTEST AAATVNRPDS LKIGTVGLPL PGVTIRIADD GEILIRGDLV
LSGYRNDETA AKEALDADGF LRSGDLGSLD ETGHLRITGR KKELLVTAGG KNIAPAPLEH
RIQENPLISQ AMLIGDQRPF IAALITLDPD AFASWRDTHG HPSTVTPADL ATDPELLAEV
QKAVDAANAT VSHAESIKKF VILPNDFTVA GGELTPSLKV KRNLILERHA AVVESIYAGA
RTAS