Gene Franean1_2941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2941 
Symbol 
ID5671327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3458621 
End bp3460942 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content67% 
IMG OID641241847 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507267 
Protein GI158314759 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.227779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCA GAGCCCACCG GCAGATAAGC GCTCCTGTCG ATGTCGTCTG GGATGTGGTC 
ACCGACCACG AGGGGATGAG TACCTGGTTC CCCGGTGTCG CCGTCGCCCT GGAACGTCCG
GGCGTGCAGG GCGGCATCGG TGCCGTCCGG ATCGTCCGGA TGACGGGCCT CAGCATCCGG
GAGCAGGTGA CGGACATCGA GCCCGCCCGC CGGCTCGCCT ACCGGTGCAT CTCCGGGCAC
CCCTTGCGGG ACTACCGCGG AGAGATCTTT CTCGCGCCGT CCGGGACCGG TACGAGCCTG
ACCTGGGTGC TCGACACCTC GACCCGTTTC CGTTTGGTCG CGTGGCTGCT GGTAAACCAG
GCGCGTCTTT TTGCCTGGTC TCTCGCCAGG GCCGCGGAGC GTTCATCCGC TCTCATCCGG
TCCGCCGACA CCGATCGGAC GAAGGAGCCC GCTGTGCAGC ACGTGCAGTC GACGGCGTCG
ACGCAGCAGA CGCCCGGCGC AGATTCCAGC CCCGGACCAG CTTCGCTGGC CGAGGCATTC
CAGCGCAATG CCAGTCGCGA CCCACAGGCG TTAGCCTTGA GCACCCCGGA CGGGTCAGCG
ACCCTCACCT GGGGGCAATA CGCCGAGCAG GTGCGCGACA TCGCGGCTGC CCTGCATGCC
CACGGCGTCC GGCGCGGCGA CAGCGTCGCC CTGATGATGC TCAACCGCCC CGAATTCTAT
CCCATCGACA CGGCCGCTAT CCATCTCGGC GCGATCCCGT TCTCGATTTA CAACACGTCG
TCCGCCGAGC AGATCCGGTG GCTGTTCGCC AGCGCGAAAC CGAGCATGGT CTTCTGCGAC
AGCAGCCACG CCGCTGCGGT GCTGGAGGCA GTCGACGGCG GCACCGCTGT CAAAGCTGTT
GTGTGCGTGG ACGGCGACGT CGAGGGGGCG ACGACCTCGG TGGAATTTCG GGGCGTCCGC
AGCGACGACT TCGACTTCGA GAGCACCTGG CGTTCGGTGA CACCGGACGA CGTTCTCACG
TTGATCTACA CATCCGGCAC GACGGGGGAG CCGAAAGGCG TCCAGATCAC GCACGGCAAC
ATGCTTGCGC AGCTCGCCGC GACCAACACC TTCCTGGAAG CGGGCCCGGG CGACAGAGTC
ATCTCCTTCC TGCCCTCGGC GCACATCGCG GACCGGTGGG CTGCGCACTA TCTCCAGCTG
GTCTGCGGGA CGACCGTCTA CCCGCTGGTC GACCGCACTC AGCTGCTCCC GACAATGCTC
CGCGTTCGTC CCACTCTGTT CGGCGCCGTG CCCCAGGTGT GGCAGAAGAT CCGTGCTGGG
GTGCTGGCGA TGATCGACGC AGAAGCCGAC GAGGAGCGGC AGGCGGGCAT CCAGCAGACC
CTGGCCGTCG GAGCCCGGTA CGCGCGAAGC CGCAGCGATG GCACCCTCAC GGCCGAGCTC
GAGAGCCTTT TCGCAACGGC CGACACGCAG GTGCTCAGCC ATCTGCGTTC CAGGCTCGGC
CTGGACCAGG CGCGGATCGT GATGTCTGGC GCGGCGGCGG TACCCGTGGA GATCGTCGAG
TTCTTCAACT CCATCGGGGT TCCGCTCATC GACGGGTGGG GGATGTCCGA GCTCTCCTGC
ATGGGCGCGT TCATGCCCAA CCACGCGCCG CGCCTGGGAT CGGTGGGCAT GGCTCTTCCC
GGTGTCCAGG TCCGCCTGGG CGAGGACGGC GAACTGCTCG TGCGCGGACC GATCGTGATG
AAGGGCTACC TCGGCCGACC CGAGCTGACT GCTGAGCTCA TTGACGACGA GGGCTGGCTG
TACACGGGCG ACGTCGCCCG CATCGACGAC GAAGGATACA TTTATATTAT TGATCGAAAG
AAAGAAATCA TTGTCAACTC CAGCGGAAAG AACATCTCGC CAGCGGGCAT CGAGGGTCAT
CTGAAAGCGG CGAGTCCCCT TATCGGCCAA GCTGTCGTGA TCGGTGAGGC GCGGCCCTTT
TTGACCGCGC TGATCGTGCT CGACGCGGAT GCCGCGGGCC AGTACGCGGC CTCCCGCGCA
CTCCCGGCGG ACGCGAGCTC GCTCGCCGCG GACGAGGGAG TCGTCGCCGC GCTCTCCGCC
GCCGTGACCG AGGCGAACAC CCACGTCTCG CAGGTCGAGC ACGTCCGGAA GTTCGCTGTC
CTCCCCCAGT TCTGGGAGCC GGGAAGCGAG CTGCTTACGC ACACGATGAA GCTGCGCCGC
AGGCCGATCG GCGCCCGCTA CGCCGACGTC ATCGAGGCGC TCTACCGGCT GCCGCGCGAC
GAGTCGGTCC TGCGTATCGC GGACGCGGCG TCCGCCTCCT AG
 
Protein sequence
MRVRAHRQIS APVDVVWDVV TDHEGMSTWF PGVAVALERP GVQGGIGAVR IVRMTGLSIR 
EQVTDIEPAR RLAYRCISGH PLRDYRGEIF LAPSGTGTSL TWVLDTSTRF RLVAWLLVNQ
ARLFAWSLAR AAERSSALIR SADTDRTKEP AVQHVQSTAS TQQTPGADSS PGPASLAEAF
QRNASRDPQA LALSTPDGSA TLTWGQYAEQ VRDIAAALHA HGVRRGDSVA LMMLNRPEFY
PIDTAAIHLG AIPFSIYNTS SAEQIRWLFA SAKPSMVFCD SSHAAAVLEA VDGGTAVKAV
VCVDGDVEGA TTSVEFRGVR SDDFDFESTW RSVTPDDVLT LIYTSGTTGE PKGVQITHGN
MLAQLAATNT FLEAGPGDRV ISFLPSAHIA DRWAAHYLQL VCGTTVYPLV DRTQLLPTML
RVRPTLFGAV PQVWQKIRAG VLAMIDAEAD EERQAGIQQT LAVGARYARS RSDGTLTAEL
ESLFATADTQ VLSHLRSRLG LDQARIVMSG AAAVPVEIVE FFNSIGVPLI DGWGMSELSC
MGAFMPNHAP RLGSVGMALP GVQVRLGEDG ELLVRGPIVM KGYLGRPELT AELIDDEGWL
YTGDVARIDD EGYIYIIDRK KEIIVNSSGK NISPAGIEGH LKAASPLIGQ AVVIGEARPF
LTALIVLDAD AAGQYAASRA LPADASSLAA DEGVVAALSA AVTEANTHVS QVEHVRKFAV
LPQFWEPGSE LLTHTMKLRR RPIGARYADV IEALYRLPRD ESVLRIADAA SAS