Gene Franean1_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1998 
Symbol 
ID5670399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2399586 
End bp2402519 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content77% 
IMG OID641240919 
ProductCoA-binding domain-containing protein 
Protein accessionYP_001506341 
Protein GI158313833 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0769955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTCGGC GACCGGCTCG TCGGACATCC CCCCGCGGGC ATCCGTCGGC CGCCACCTGG 
TCTGCGCCAT TGTGCAATCC AATGCCGGAC GGTTACCGAC ACGGGGTCGA GCGACGGGCC
CGCGGCGGAC GACGGAGGCG GCGCGGTCCG GCGCGGAATA GCGTCCACGG GGTGACATCT
CGCTACCCGG CGCACTGGGC GGCCGACGGC GTTCTCGCCG ACGGCGCGCC CGTTCAGCTG
CGCCCCGTGC TGGGCACCGA CGCGCCGGGT CTGGCCGAGC TGCGGTCCGG CCTGTCCGCG
GCGGACGTCG CGCGCCTGCC CGCGCGGTGG GCGCGGCGCT CCCCGGAAGA GCTCGCCGGG
CACCTGACCG CGGCCGGCGA CCCGGCCGCC GCCGGTCACC CCACCATCAC CGGTCACCCC
ACCACCGGTG ACAGCGGGCG GCTCGCCGTC GCGGCCGTGC TGCGCGGACG TCTGGTCGGC
ACGGCGGACT ACGAACGGAT CGCCGGCTCC GACGACGCGG TGGTCGCCCT GGTCGTCGAG
GCTGCGCACC GCGGCCGTGG GCTCGGACTG CTGCTGCTCG AACACCTGAT CGCCGCCGCC
CGCGAGCGCG GGGTGAGCCA CCTCGTCGCC GATCTGCGCG CGGGCGACGA CCGGGCGCTG
CGGGTCTTCC ACGCCGCGGG CTTCGCCGGC GCCGAGACCC GCCCGCGCAC CCCCCGGCAG
GACGGCGCTG GTCAGGACGG CGCTGGTGGC GTGCGGGTGG TGTTCCCGAC GGCACAGACT
CCGCGCACCC GGGGCATCTC CCGGGCGCTG GAACAGCGGG CGGAGGCACG GAGCATCGCG
CGGCTGCTCA CACCGCGCGC GGTCGCGGTC GTCGGCGCGA GCCGGCAGCC CGGCAGCGCC
GGCCACGAGG TCTTCCGCCG GCTGCTGGCC AGCGATTTCC ATGGCCCGGT CTACCCGGTC
AACCCGGCGG CGCGCCAGGT CGCCTCGGTC TACGCCTACC CGGACGTCCG CGAGATCCCG
GACGCGGTCG ACCTCGCCGT GATCGCCGTC CCGGCGCCGG CCGTGGCCGA CGCGGTGCGG
GCCTGCGCGG AGAAGGACAT CCGTGGCCTG ATCGTCGTCT CGGCCGGGTT CGCGGAGGCC
GGGCCCGACG GGCGGGCCCG GCTCGCCGAG GTCACCCGGC TGGCCCGGGA GTCCGGCATG
CGGCTGATCG GCCCGAACGC GATGGGCGTG ATCAACACGG ACCCGGCCGT CCGCCTGCAC
GCCACCTTCG CGGCCGGCGA CCCGCCGGTG GGAAGGGTCG GCGCGTTCAC CCAGTCGGGG
GCGCTGGCCG GGACGTTCCT CACCGAGGCG TCGCGGCGCG CGATCGGCCT GTCCACGTTC
GTCTCCACCG GTGACCGCTC GGACGTCTCG GCCAACGACG TGCTGCAGTA CTGGCAGTCG
GACCCGCACA CCGATGTGAT CATGCTGCAC CTGCAGGGGT TCGGCAACCC GCGGAAGTTC
GCCCGGATCG CCCGGCGGGT GGGCCGACGC AAGCCCGTGA TCGCGCTGAA GAGCGGGCGC
AGCGCCGCCG ACCCGGCCCT GGACGCCCTG TTCACCAGCG CCGGGGTCAT CCGGGTGGAC
ACGTTGAGCC AACTGTTCGA CCTGGCCGCG CTGCTGGCGT CCCAGCCGCT GCCCGCCGGG
CGGCGCGTCG GCGTCGTGGG GACGTCCAGC GCGCTCGCCG CCCTGGCGAC GGACGCCTGC
CGGACGGCAG GCCTGGAGGT ACCGCCCTTC TCCACCGCCA CGGCGGAGGC GTTGAGCGAC
ACGCTCGGCC GCCCGGAGCC GGCCAACCCG GTGGACCTGG GCGCGATGGC CGCGCCCGAA
CGGTTCGAGC GCGCGCTGCG CGCGGTCGCC GCCAGCGCGG ACGTCGACGC CGTGCTGGCG
CTGATCACCC CGCACCCGGC CGTCGAGGAG CTCGCGCGGG CCGTGCGGGC CGTGGCGGGC
TCCGGCCGGG TGCCCGTGGT GGCCTCCTAC CTCGGGTACG ACGGGATGCC GTCCGCGCTG
GCCGCCCCGG GCGACGGCAT CGTGACGCCC GCACCCGGCT CCGTGCCGTC GTTCGCCTCC
CCGGAGTCGG CCGCGCTCGC GCTCGCCCGG GCGGCGGGCC ACGCGGCGTG GCGCAGCCGC
GAGCAGGGCG CCGTCCCCAC TCTCGACCGG CTCGACCTGG ACCGCGCGCG CCGCGCGGCG
GCCGCCGGCC CGACGGACGG GACGTGGCTG CCCCAGGAGC TGGTCGGCGA CATCCTGGGC
GGGGTGGGGC TGGCGGTCTG GCCCAGCGAG CCGGTGACGA GCGCCGCCCA GGCCCTGGAC
ACGGCCGAAC GGCTCGGCTG GCCCGTCGCC CTGAAGATCG CCGACGAACG CTTCCGTGGG
CGGCTGGACG TCGGAGCCGT CCGGCTGGGC GTCGAGGGGC CCGGCGCGCT CGCGGAGGCC
TGGCGCACGA TCCGCGCCGC GGTCGGGCCG GGCGACATGG TCGTCCAACC GATGGCGCCG
GCCGGGGTGT CGACCGTGAT CCGGATGACC CAGGACCCGG CGATCGGGCC GCTGCTGTCG
CTGCGCCTCG GCGGGGCCGT CGCGGACCTG TTGGTCGACC CGCTGGCCCG GGCGCTGCCG
ATCACCGACC GGGACGCCGC CGAGATGGTG CGGGGTATCC GCGGCGCGGT GCTGCTGGTC
GGCGGCGCCG GCACCCCGGC GGCGGACACG GCCGCCCTGG AGGACGTGCT GCACCGGCTG
GCCCGCCTCG CCGAGGAGGT GCCGGCGGTC GCCGAGGTGC TCCTGGATCC GGTGCTCGTC
GGCCGGCCCG GCGTGGTCCT ACTGCATGCC GGCGTCCGCC TGCTCCCGCC GGGAACCGAT
CCCGAGTCAC TGCCCCGGCG GATGACGGGC TCCGGCGTCG AGTACTTCCG CTAG
 
Protein sequence
MCRRPARRTS PRGHPSAATW SAPLCNPMPD GYRHGVERRA RGGRRRRRGP ARNSVHGVTS 
RYPAHWAADG VLADGAPVQL RPVLGTDAPG LAELRSGLSA ADVARLPARW ARRSPEELAG
HLTAAGDPAA AGHPTITGHP TTGDSGRLAV AAVLRGRLVG TADYERIAGS DDAVVALVVE
AAHRGRGLGL LLLEHLIAAA RERGVSHLVA DLRAGDDRAL RVFHAAGFAG AETRPRTPRQ
DGAGQDGAGG VRVVFPTAQT PRTRGISRAL EQRAEARSIA RLLTPRAVAV VGASRQPGSA
GHEVFRRLLA SDFHGPVYPV NPAARQVASV YAYPDVREIP DAVDLAVIAV PAPAVADAVR
ACAEKDIRGL IVVSAGFAEA GPDGRARLAE VTRLARESGM RLIGPNAMGV INTDPAVRLH
ATFAAGDPPV GRVGAFTQSG ALAGTFLTEA SRRAIGLSTF VSTGDRSDVS ANDVLQYWQS
DPHTDVIMLH LQGFGNPRKF ARIARRVGRR KPVIALKSGR SAADPALDAL FTSAGVIRVD
TLSQLFDLAA LLASQPLPAG RRVGVVGTSS ALAALATDAC RTAGLEVPPF STATAEALSD
TLGRPEPANP VDLGAMAAPE RFERALRAVA ASADVDAVLA LITPHPAVEE LARAVRAVAG
SGRVPVVASY LGYDGMPSAL AAPGDGIVTP APGSVPSFAS PESAALALAR AAGHAAWRSR
EQGAVPTLDR LDLDRARRAA AAGPTDGTWL PQELVGDILG GVGLAVWPSE PVTSAAQALD
TAERLGWPVA LKIADERFRG RLDVGAVRLG VEGPGALAEA WRTIRAAVGP GDMVVQPMAP
AGVSTVIRMT QDPAIGPLLS LRLGGAVADL LVDPLARALP ITDRDAAEMV RGIRGAVLLV
GGAGTPAADT AALEDVLHRL ARLAEEVPAV AEVLLDPVLV GRPGVVLLHA GVRLLPPGTD
PESLPRRMTG SGVEYFR