Gene Franean1_2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2695 
Symbol 
ID5671086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3189088 
End bp3190638 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content66% 
IMG OID641241607 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507027 
Protein GI158314519 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCGG GCTTCTACGC CGAGAAGGAC CCCGCCAAGC CGGCCGTAGT GCTCTGCCCG 
GCCGGAGAGC GGGTCAGCTA CGGATCGCTG GAGGCACGGT CCCGCCAGTT CGCCCGCGTG
CTCCGAGCCC GCGGACTGCG GCCCGGCGAC ACGGTGGCCC TCCTGGCCGA GAACCACGCG
CGCTACCTGG AGGTGTACTG GGCAGCGATC CGCTCCGGGC TCTACCTGAC GGCGGTCAAC
TGGCACCTGA CCGCGGCCGA AGCCGCCCAC CTGCTCGGTG ACTCCGCAGC ACGCGTACTC
GTCACTACCG CCCGGTTCAC CGACCTGGCC CGCACGGCCG CGGATCTCAG CCCGACCTGC
TCAACGCTCC TCCTCCTGGA CGGGACCGAG GACGGCTTCG AATCGTACGA GGAAGTGATC
GCGGCCCAGT CCGCCGCACC GCTCGCCGAC CAGCCAGCCG GCGACGTCAT GCTCTACTCC
TCCGGCACGA CCGGACGCGC CAAAGGCATC CGACGCCCGC TGTCCGACCT GCAGGTGGAC
CAGCCCGGCC GCCCCAGTGC CTCTCCGATG GCAAAGGCAT TTCTCGGAAT CGGCGAGGAC
TCGACATACC TAACCCCGGC GCCGCTGTAC CACGCAGCTA GCCTGCACTG GGCAGCCGGC
GCCCACGAGC TCGGCGCGAC ACTCGTCATC ATGGACCGCT TCGACGCCGA ACAGATGCTT
GCCGTTATCG AAAAAGAACG AGTCACCCAC GCCCAAGTCG TCCCCACGAT GATGATCCGC
CTACTGAAAC TCCCGGCCGA AGTACGAACG AGATACGACG TCTCCAGCCT CCGCTCATTG
ACACATGCGG GAGCACCCTG CCCCCCGGCC ATCAAACGTC AGATGATCGA CTGGCTCGGC
CCGATCGTCG ACGAGTACTA CTCCAGCACT GAAGGCTCCG GTATGACGTT CATCGGCTCC
GCCGACTGGC TGGCACATCC GGGATCTGTC GGCAGAACAA TCATCGGCAC CCCGCACATC
TGCGACGACA ACGGTAGGGA GCTACCGGTA GGCGAGCCCG GGCTGCTGTA CTTCGACCGG
GGGACGGAGC ACTTCGAATA CCACAACGAC CCCGAAAAGA CTCGCGAGGG CCGCCACCCC
AAGCACCCGA CCTGGACGAC CTCCGGAGAC ATGGGCTACG TCGATACCGA CGGCTACCTA
TACCTGACGG ACCGCAAAAG CTTCATGATC ATATCCGGAG GGGTCAACAT CTACCCCGCC
GAGATCGAGG CCGCCCTCAT CCTGCACCCC GCCATCACGG ATGTCGCCGT CTTCGGCCTT
CCGCACGCCG ACATGGGCGA ATATGTCCAC GCCGTCGTTC AGCCCACGGA CGGCGTCGAC
GCCACACCCG AACTCGCCGA GCAAATCCGC GCGTTCGCCC GCGACCACCT CGCCGGCTAC
AAGGTCCCCC GAGCAATCAC CTTCCGCGAC CAGCTACCGC GCATGTCCAC CGGCAAACTC
GCCAAGAACG CCCTGCGCCA GGAATACCTC GGTGCTGCGC TACCGCGGTA G
 
Protein sequence
MYPGFYAEKD PAKPAVVLCP AGERVSYGSL EARSRQFARV LRARGLRPGD TVALLAENHA 
RYLEVYWAAI RSGLYLTAVN WHLTAAEAAH LLGDSAARVL VTTARFTDLA RTAADLSPTC
STLLLLDGTE DGFESYEEVI AAQSAAPLAD QPAGDVMLYS SGTTGRAKGI RRPLSDLQVD
QPGRPSASPM AKAFLGIGED STYLTPAPLY HAASLHWAAG AHELGATLVI MDRFDAEQML
AVIEKERVTH AQVVPTMMIR LLKLPAEVRT RYDVSSLRSL THAGAPCPPA IKRQMIDWLG
PIVDEYYSST EGSGMTFIGS ADWLAHPGSV GRTIIGTPHI CDDNGRELPV GEPGLLYFDR
GTEHFEYHND PEKTREGRHP KHPTWTTSGD MGYVDTDGYL YLTDRKSFMI ISGGVNIYPA
EIEAALILHP AITDVAVFGL PHADMGEYVH AVVQPTDGVD ATPELAEQIR AFARDHLAGY
KVPRAITFRD QLPRMSTGKL AKNALRQEYL GAALPR