Gene Franean1_2862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2862 
Symbol 
ID5671251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3376783 
End bp3378369 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content66% 
IMG OID641241771 
Productacyl-CoA synthetase 
Protein accessionYP_001507191 
Protein GI158314683 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGTA CCGTTGAGCC GGCCGCGCAC CCTGGTTCGC TCCACCGGCC CGCAACGCAG 
GCCGACCTGC TCGTCAGGTC CCTGACGCGC GATCGGTCCC GACCCGTGCT GTACATGGGC
GATGATGTCC TGACCGCCGG ACAGTTCGCT GATGAGATCA GCCGCTACGT CCAGGCCTGG
CAGGACTGGG GAATCACCGT GGGCAGCGGG GTGGCGATTC TGTCGCCCAA CCGGCCCGAG
GTGCTCATCT CGATGGGCGC CGCGCTGGTG GCGGGAGTGG TATGGACGCC GCTACATCCG
CTCGGTTCGC TTGAAGATCA GGCGTTCATC CTTGCGGATG CGGGCGTCGA GACGCTGCTC
TTCGACCCGG TCGCTTTCGG CGATCGGGCC GCGCAGCTCG GCGAGCGGGC CCCATCCCTG
AAGCGGCTCA TCGCGCTCGC ACCCACCAAT GACGCCGAGG ACATCGCCAC CAGGGCGGCA
CAGTTCGGCG CGCGCCACCT GTCCGCCCCG AACGTGCGGC TGTCAGATCC GTCCTGGATC
GTCTATACCG GCGGTACCAC GGGTCGTCCC AAGGGAGTGG TCACGACGCA CCAGGGCATC
GCGACCATGA CCGACATTCA GATGGCCGAG TGGGATTGGC CACGCGAATT GCGGACACTG
TGCGTAACCC CGCTCAGTCA CGCTTCGTCG GCCCTCTTCC TGCCCACCGT GCTGCGCGGT
GGGTCGCTGG TGGTCACCTC GTCCTTTGAT CCCGATCAGT TCCTCGTCTT GATCAAGCGC
TACCGGATCA CGGCGACATT CCTCGTGCCC ACAATGATCT ACCGCCTGCT CGACCACCAC
CGCCTCCGAA CGGCGGATCT CAGCAGTCTC GAGACGCTGT TCTACGGTGC ATCCGCTATG
TCACCCAGCC GGCTCGCCGA AGCCATGGAG ACGTTGGGCC CGATCTTCTT CCAGTTCTAC
GGCCAGGCGG AATGCCCCAT GACGGTGACC GTCCTACGGA AGAAACAGCA CCATCCGGAC
CGGCTCGCGT CCTGCGGGCA GCCGGTGCCC TGGCTCGATG TCGCCCTGCT CGACGACGAC
GGCCGCGAGG TGGATACCGG CGAACCCGGT GAGATCTGCG TCCGGGGGCC GCTCGTCATG
GCCCGCTATC ACAACCAACC AGACCAGACT GCCGAGGCGT TCCGGCACGG TTGGCTGCAC
ACCGGAGACA TCGCAACCGC CGATCACGAT GGTTTCCTCA CTATCGTCGA CCGGAAAAAA
GACATGATCG TTACCGGTGG GTTCAACATA TTTCCGCGGG AAGTGGAGGA CGTTCTCGCC
ACCCACCCGG AAGTCTCCGC GGCCGCGGTC ATCGGCGTAC CGGACCCGAT CTGGGGTGAG
GCGGTGAAGG CCGTCGTCGT CCGCTGCCCG GGTGCATCCG TGCGCGCGGA GGACCTGGTG
CGGCTCGTCA AGGAACGCAA GGGGCCCGCC GCCGCGCCGA AATCAGTCGA CTTCGTCGAT
ACCATCCCGC TCAGCCCGCT GGGAAAGCCC GACAAGAAGG CGCTACGAGC AGCACACTGG
ACCGGGTCCG CCCGAAACGT CAGCTAA
 
Protein sequence
MPGTVEPAAH PGSLHRPATQ ADLLVRSLTR DRSRPVLYMG DDVLTAGQFA DEISRYVQAW 
QDWGITVGSG VAILSPNRPE VLISMGAALV AGVVWTPLHP LGSLEDQAFI LADAGVETLL
FDPVAFGDRA AQLGERAPSL KRLIALAPTN DAEDIATRAA QFGARHLSAP NVRLSDPSWI
VYTGGTTGRP KGVVTTHQGI ATMTDIQMAE WDWPRELRTL CVTPLSHASS ALFLPTVLRG
GSLVVTSSFD PDQFLVLIKR YRITATFLVP TMIYRLLDHH RLRTADLSSL ETLFYGASAM
SPSRLAEAME TLGPIFFQFY GQAECPMTVT VLRKKQHHPD RLASCGQPVP WLDVALLDDD
GREVDTGEPG EICVRGPLVM ARYHNQPDQT AEAFRHGWLH TGDIATADHD GFLTIVDRKK
DMIVTGGFNI FPREVEDVLA THPEVSAAAV IGVPDPIWGE AVKAVVVRCP GASVRAEDLV
RLVKERKGPA AAPKSVDFVD TIPLSPLGKP DKKALRAAHW TGSARNVS