Gene Franean1_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3965 
Symbol 
ID5672326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4747678 
End bp4749519 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content77% 
IMG OID641242844 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508261 
Protein GI158315753 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.666614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.454806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACG CTCCGGCCGA GGTGGCGATC CCGGCGCCCC GCGCCGTCCC CGTTTCCTCA 
CCCGTCGATC TCGTACCTGC CGATCTCGTA CCTGCCGATT CCGTGGCCGC CGGCTCGACG
GCCGCCGCCG CGCTCGCGGC GGGCACCCTG CACGCCGGCC TGGCCACGGT GGCGGCCCGC
CATCCGGGGT TGCCTCTCGA CTTCCCCTCC GCCGGAGCGT CGCTGACGCT GGGCGAGCTC
GTCGCCCGCG CGGACGTCCT GGCGGCAGCC CTGACCGGTG CCGGGGTCGT CGCCCGTGAC
CGGGTCGGAG TGCTCAGCGA CAACGCGCCC GACTTTCTGG TGGCCCTGGC CGGAGTGAGC
CGGGCGGGAG CCGCCGCGTG CCCCCTGCCG CTGCCCGCCT CCACCCGTGA CCTGCCCGGC
TACGCCGCGC GGCTGGCGCG CGCGGTCGCC GTCGCCGACA TCCGGCTCGT GCTGGTCGGC
GGCCGGACGG CCCGGATGGC CGACCGCTTC GCCGGGGCCT TCGACGGCGT CCGCCTCGTC
CGGGTCGCCG ACCTCACCAC ACCGGCCGCC GCCGGCACTG TCCCGGCCAC CGGCACCGCG
CCAGCCGCCG CTGGCGGTGC GGCCCCGGCC GGCCCGGCGG TGGAGGTGTC ACCCGACGAG
GCCGCCCTGG TCCAGTTCAC CTCCGGTAGC ACCGCCGCCC CGAAGGGCGT CGTACTGACG
CACCGCAACA TCCTGGCCGG GCTGGCCGCG ATCATCGGCG GCGTCGCGCT GACCGAGGTC
GACCACGGCG GCATCTGGCT GCCGCTCTTC CACGACATGG GCCTGTTCGG CACGCTCGCC
GGCATCTTCA CCGGCATGCC GATGACCGTC TGGTCACCGG CCGCCTTCGT GAAGGACCCG
GCCGGCTGGC TGACAGACTT CCTCGGCCGC GGCGGCAGCA TCGCCCCCAT GCCGAACTTC
GCCTACGACC ACCTGGCGGA GGCCGTCCCC GCGCCGCGGG AGGCCGGCCT GGACCTGAGC
GGCTGGCGGG TCGCCTTCAA CGGCGCCGAG CCGGTCGAGC CCGCCTCGGT CGAGCGCTTC
CTCACCACCT TCACCCCGGC CGGCTTCGCG CCGGCGGCGA TGATGCCCGT CTACGGGATG
GCCGAGGCCA CGCTGGCGGT GACCTTCCCG CCACCGGGCC GCGCCCCCGT GCACCGCTGG
GTCGACCGCG ACCTGCTCGC CCGCGACGGC GTCGCGCGCG ACGTGCCCGC CGGCTCGCCG
TCCGCCCGCG GGCTGGCCGG GGTGGGGCGA CCGGTGCGCG CCATGCGGGT GCGGATCGGC
GGCCGCGACG GCACCGGCGT GCTCGGTGAC GACCAGGTCG GCGAGATCCA GATCAGCGGC
GACGCGGTGA CGGGCGGCTA CCTGACCGAC ACCGGCGCGC AGCCGTCCGG CGCGTTCACC
GCGGACGGCT GGCTGCGCAC CGGCGACCTT GGCCTGCTGC GCGACGGCGA GCTCTTCGTC
ACCGGCCGGG ACAAGGAGAT GGTGATCGTC CGCGGGGTGA ACTACTACCC CCACGACGCC
GAGGAGGCCG CCCGGGACGT CCCCGGCGTC CACCGCCGCC GCTGCGTCGC CTACGCGGAC
CGCTCACCCG GGGGCGCCGA GACGATGGCA GTCCTCGCCG AGACCCGGCT GGTCGACGAC
ACCGAGCGCG CGGCGCTGGC CGCCGCGATC CGGGTGGCGG TGACCGCCGC GCTGGGGCTG
GCCGAGATCG CCGTCGCCCT CGTCGGGCCC GATGCCCTGC CGCGGACGTC CAGCGGGAAG
TTCCAGCGCC TCGCCGCGCG CGAGGCATGC GTACCGACAT GA
 
Protein sequence
MHDAPAEVAI PAPRAVPVSS PVDLVPADLV PADSVAAGST AAAALAAGTL HAGLATVAAR 
HPGLPLDFPS AGASLTLGEL VARADVLAAA LTGAGVVARD RVGVLSDNAP DFLVALAGVS
RAGAAACPLP LPASTRDLPG YAARLARAVA VADIRLVLVG GRTARMADRF AGAFDGVRLV
RVADLTTPAA AGTVPATGTA PAAAGGAAPA GPAVEVSPDE AALVQFTSGS TAAPKGVVLT
HRNILAGLAA IIGGVALTEV DHGGIWLPLF HDMGLFGTLA GIFTGMPMTV WSPAAFVKDP
AGWLTDFLGR GGSIAPMPNF AYDHLAEAVP APREAGLDLS GWRVAFNGAE PVEPASVERF
LTTFTPAGFA PAAMMPVYGM AEATLAVTFP PPGRAPVHRW VDRDLLARDG VARDVPAGSP
SARGLAGVGR PVRAMRVRIG GRDGTGVLGD DQVGEIQISG DAVTGGYLTD TGAQPSGAFT
ADGWLRTGDL GLLRDGELFV TGRDKEMVIV RGVNYYPHDA EEAARDVPGV HRRRCVAYAD
RSPGGAETMA VLAETRLVDD TERAALAAAI RVAVTAALGL AEIAVALVGP DALPRTSSGK
FQRLAAREAC VPT