Gene Franean1_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3601 
Symbol 
ID5671970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4261919 
End bp4263412 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content73% 
IMG OID641242487 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507907 
Protein GI158315399 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGAGC TGGTGGCAGA CCTGGACCGG CGCGCCGCGG CGGGCGCCAG GGGCGTCGTG 
GCGTGGGACG CCGATGGCGC CCACACCCTC GGTGCCGTGC TCGCCGCGGG CACACAGCTC
GCCGAGGCGC TGGCGGGGGC CTCGGGACCA GGGGCGACGG TCATGGTGCA GGCGGACAAC
TCGTGGCGCA CCGTCGCGGC GGCGGTCGCC GCCGCGAGGC TCGGAGGAGT CCTCGCGCTG
ATCAGCCGGC ACGCGACCGG CGTCGAATTC GTCCAGGCGT GCGAGGACCT GGACCCCGAC
GCCGTGGTGG CCGCCCCGGA CACCGCGCAG GGCTGGGCCG TCGCCGACAA GTACCCCGCC
CTCAGCGTTG ATGTGCTGGC CGGCTGGTCC GCAGCCGCCC GGCCTGCGCG GGGGTCAGCG
CGGTGGCGCG GCGGAGCGGT CATCGGGCTG ACCTCGGGTT CCACCGGCCG GGCCAAGGGC
GTCGTGCAGT CCGAGGCGGC GTTGCGGTAC GCGGGCCGCT GCACGATCGA TGCGGTCGGG
CTTCGCCCGG GCGACCCGGT GGCCGCGATG GTGCCCATGT CGTCCAGCGC TGCGTTCTGC
TTCGGGCTGT ACCTGCCGCT TTTGCTCGGC TCACCGATTG TCTTCTCCGA ACGGTGGGAC
CCGGCTGCCG CGGTGGCGCG GATGGCCGTG TTCGACGTGC GCTGGACAAT GTGCGTGCCG
ACGATGGCGC TGCAACTCGC TGCGGCCGGG CGGGCGGGGG AGCTGAGCCG GGTCCGGGCG
ATGACGGTCG GAGGCGGCCC GATGGACACC GGGGCGCTGG GCCGCGCCGA ACGGCATCTC
GGCACCAGGA TCCTGCGTGT CTTCGGAATG TCCGAATGTC TCGGCCACAC CACCCCCGCG
CTCGACGACC CCGAGGAGAT CCGGCTGGGA CGTGACGGCA GGCCGTTCCC GGGTACCGAG
CTGCGCGCCG TCGACGTCGA CGGAACGCCG CTGCCGCCCG GGGAGACGGG GCGGGCGCAG
GTCCGCGGGC CGTCCCTGTT CCCCGGCTAT GCCCGGGACG GTCGGACCGT CCCACCCGAA
CTGACTCCGG ACGGCTTCTT CGCCACCGGC GACCTGATCG TCGTGCACGG TGACGGGACG
GTGTCGGTCA GGGGCCGGGA GAAGGAGATC ATCATCCGCG GCGGGCGGAA CATCGACATC
GCCGAGGTGG AGCACGCGGT TGCCAGCCAC CCGGCGGTCG ACCAGATGTG CGTTGTTCCC
CTGCCCGACG ACGTGCTCGG CGAGCGCATC GGCGTACTCG TCGTGACCAC CGACGAGACA
CTGGAACTTC CCGAGCTAAC CAGGCACCTC GCCGAGCGAG GGCTGTCGAA GGCCAAGTGG
CCGGAGTTCC TGTTCAGGGT GCCGGCACTC CCGCAGAACC GGGTGGGGAA GCTTTCCCGG
GCCGAGGCCG CGCGGCTGGC CCAAGACCTG CACGCACGCG CAACCACCGG ATGA
 
Protein sequence
MQELVADLDR RAAAGARGVV AWDADGAHTL GAVLAAGTQL AEALAGASGP GATVMVQADN 
SWRTVAAAVA AARLGGVLAL ISRHATGVEF VQACEDLDPD AVVAAPDTAQ GWAVADKYPA
LSVDVLAGWS AAARPARGSA RWRGGAVIGL TSGSTGRAKG VVQSEAALRY AGRCTIDAVG
LRPGDPVAAM VPMSSSAAFC FGLYLPLLLG SPIVFSERWD PAAAVARMAV FDVRWTMCVP
TMALQLAAAG RAGELSRVRA MTVGGGPMDT GALGRAERHL GTRILRVFGM SECLGHTTPA
LDDPEEIRLG RDGRPFPGTE LRAVDVDGTP LPPGETGRAQ VRGPSLFPGY ARDGRTVPPE
LTPDGFFATG DLIVVHGDGT VSVRGREKEI IIRGGRNIDI AEVEHAVASH PAVDQMCVVP
LPDDVLGERI GVLVVTTDET LELPELTRHL AERGLSKAKW PEFLFRVPAL PQNRVGKLSR
AEAARLAQDL HARATTG