Gene Franean1_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3708 
Symbol 
ID5672074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4390621 
End bp4392276 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content72% 
IMG OID641242591 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508011 
Protein GI158315503 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACG TCGCGAGCCT GCTGACCGCC TCGGCGCGGC GGCACCCGGA CCGGTGCGCG 
GTGCGGTTCG CCGGACGCCG GACGTCCTAC GGCGAGCTGC GCGAGCAGGC GGCCAGGTTC
GGCTCGGCGC TGCTCGGGCG CGGCCTGGAA CGCGGCGACC GGGTCGCGGT GCTGCTGCCC
AACTGCCCGC AGTACCTGGC CGTGCTGTTC GGGGCCTGGC ACGCCGGCCT GGTCGCCGTG
CCGATGAACG CCAAGCTGGC CGGCCCGGAG ATCCAGGTGA TCCTGGACGA CAGCGGCGCC
CGGGCGTTCG TCCACGCCGG CGCGGGCACA GTCGCCGGCC TCGATCTCAC CGGGGTCCAG
GAGGTGGTGG TCGACGTCCG CGGCGTCGGC GCCGGTGTCA GCGCCGACAC CGACACCGAC
ACCGGAGACG GTGCCGTTGC CGGCAGGACT GCCGGAGGCG GCCCGAGCGG GTTCGACCGG
CTGCTGGCCG AGGGCTCCGC CGAGCTGGTA CCGGTCGACG TCGCCGGGGA CGACCTCGCC
TGGCTGTTCT ACACGTCGGG GACGACCGGC CGCCCCAAGG GGGCCGAGCT CAGCCACCGC
AACCTCACCG TCACGACCTG GACGCTGCTC GCCGACGTGT GCGACTACCG GCCCTCGGAC
CTCGCCCTGC ACGTCGCGCC GCTGTCCCAC GGCAGCGGCC TGTACTCCCT GGGCGCGATC
GCCCGCGGCG CCGAGAACCT GATCCACGAC GGCGGCGGGT TCGACCCGGC CGAGGTACTC
GAGCTCGTCG CGCGCGAACG GATCACCGTC ATCGCCTTCC TGGTGCCGAC GATGATCGTG
AAGCTGCTCG GTGCCCCCGA GACGGACACG AGCTCGTTGC GCTGCGCCGT CTACGGCGGC
GCGCCCATCC ACGTCGAGCA CTCCCGCGCG ATGATCGAGC GGTTCGGGCC GGTGTTCGTG
CAGATCTACG GCCAGGGCGA GTCACCCATG ACCATCACCT ACCTCGACCA CGGCGCCTCA
CCCGACACCC CCCTCGACTC GGCGGGTGTG GCACACCCGG GGGTCGAGGT GCAGATCATG
GGCGCCGACG ACCGGCCGCT GCCCGCCGGC GAGGAAGGCG AGATCTGCGT CCGCGGGGAC
GTGGTGATGC GGGGCTACTG GAACAACCCG GAGGCGACCA GCCGGGCGTT GCGGGGCGGC
TGGCTGCACA CCGGTGACAT CGGCCGCCTC GACGAGCACG GCCGCCTGTT CCTCCTCGAC
CGCAGCAGCG ACGTCATCAT CTCCGGCGGG TCCAACATCT ACCCGCGGGA GGTCGAGGAG
GTGCTGATCC AGCATCCCGC CGTCGCCGAG GTCGTGGTCT TCGGCGTACC CGACGAGCTC
TGGGGCGAGA ACGTCGTCGC CGCGGTGGTG CCCGCGGCCG CGCCGCCCCC GGCGAACGAC
CTCATCGACT TCAGCCTCAC CCACATCGCC CGCTTCAAGA AGCCGAAGCA GATCATCTAC
GTCGACGCGC TGCCCAAGAG CTCCTACGGC AAGGTCCTGC GCCGGGAGGC CCGCCGGCTC
GCACTGGCGG CGGGCGAGAC AGCCGGCCAC GAGCACGTGA CCGGCCAGCC TGCCACCATC
AAGTCCGTTG ACCGCGATCC AGGAGCCGCC GAATGA
 
Protein sequence
MVNVASLLTA SARRHPDRCA VRFAGRRTSY GELREQAARF GSALLGRGLE RGDRVAVLLP 
NCPQYLAVLF GAWHAGLVAV PMNAKLAGPE IQVILDDSGA RAFVHAGAGT VAGLDLTGVQ
EVVVDVRGVG AGVSADTDTD TGDGAVAGRT AGGGPSGFDR LLAEGSAELV PVDVAGDDLA
WLFYTSGTTG RPKGAELSHR NLTVTTWTLL ADVCDYRPSD LALHVAPLSH GSGLYSLGAI
ARGAENLIHD GGGFDPAEVL ELVARERITV IAFLVPTMIV KLLGAPETDT SSLRCAVYGG
APIHVEHSRA MIERFGPVFV QIYGQGESPM TITYLDHGAS PDTPLDSAGV AHPGVEVQIM
GADDRPLPAG EEGEICVRGD VVMRGYWNNP EATSRALRGG WLHTGDIGRL DEHGRLFLLD
RSSDVIISGG SNIYPREVEE VLIQHPAVAE VVVFGVPDEL WGENVVAAVV PAAAPPPAND
LIDFSLTHIA RFKKPKQIIY VDALPKSSYG KVLRREARRL ALAAGETAGH EHVTGQPATI
KSVDRDPGAA E