Gene Franean1_5914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5914 
Symbol 
ID5674235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7184281 
End bp7185936 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content68% 
IMG OID641244762 
Productcarboxyl transferase 
Protein accessionYP_001510164 
Protein GI158317656 
COG category[I] Lipid transport and metabolism 
COG ID[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.744099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.878673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGGT CTCCACATCA ATCGAGCACG TGGACCACCG ACCGCCGTAC GCTTCGCGGT 
ATGGCTCTCC CGGCTTCCCC GCAAGAACCG ACGCCCGAAC CGCACAGCAC AGCCGGCCGC
CTCGCCCGGC TGCAAGCGGT GACCGACGAG GCCGTCCATG CCGGTTCGCA GCGCGCGGTG
GACGCGCAGC ACGCCAAGGG AAAGATGACG GCCCGGGAGC GGATCGAGGC GTTCCTCGAC
CCCGGCTCGT TCGTGGAGAC GGACGCGTTC GCGCGCCACC GCTCGCACGA CTTCGCGATG
GACCGCAACC GCCCCTACGG CGACGGCGTC GTCACCGGCT ACGGCACGGT GGACGGCCGC
CAGGTCTGCG TCTTCAGCCA GGACTTCACC GTCTTCGGCG GGAGCCTGGG CGAGGTCTTC
GGGGAGAAGA TCGTCAAGAT TATGGATCTC GCGCTCAGGA CGGGAGTGCC GGTCGTCGGT
ATCAACGACT CCGGCGGCGC GCGGATCCAG GAGGGCGTCG TCTCGCTCGG CCTGTACGGG
GAGATCTTCT ACCGCAACGT CATGGCGTCC GGCGTCGTGC CGCAGCTCTC GCTGATCATG
GGCCCGTGTG CGGGCGGCGC GGTGTACTCA CCCGCGATCA CCGACTTCAC GCTCATGGTC
GACCAGACCA GTCACATGTT CATCACCGGC CCGGACGTGA TCAAAGAGGT CACCGGCGAG
GACGTCGGCA TGGAGGAGCT GGGTGGCGCC ACCACCCACA ACTCCCGCAG CGGCGTCGCG
CACTTCCAGG CCGCGGACGA GGCCGACTGC CTGGAGCTGG CCCGCGCCCT GCTGTCCTAC
CTGCCGTCGA ACAACATGGA CGAGGTGCCG GCCTACACCG AGGTCGCCGA CGTCGAGGCC
GAGGTCGACC CGGAGCTCGA CACGTTCATC CCGGATTCGC CGAACACCCC GTACGACATG
CACCATGTGA TCGAGCGGAT CCTGGACGAG GAGGACTTCC TCGAAGTCCA CGCCCAGTTC
GCACAGAACA TGATCGTCGG CTTCGGGCGC ATCGACGGCC GCTCGGTCGG CGTCGTCGCC
AACCAGCCGA TGCAGTTCGC CGGCACCCTG GACATCGACG CCAGCGAGAA GGCCGCGCGC
TTCGTGCGCA CCTGCGACGC GTTCAACATC CCGGTGCTGA CGTTCGTGGA CGTTCCCGGC
TTCCTGCCGG GAACGTCGCA GGAATGGAAC GGCATCATCC GGCGCGGCGC CAAGCTCATC
TACGCCTACG CCGAGGCGAC CGTTCCCAAG GTGACGGTCA TCACACGGAA GGCCTATGGC
GGCGCCTATG ACGTCATGGG TTCGAAGCAT CTGCGCGCCG ACATCAACCT GGCGTGGCCG
ACGGCCGAGA TCGCCGTGAT GGGTGCCCGC GGCGCGGTGA ATATCATCTA TCGCCGCGAG
CTCGCCGGCT CGGAAGCGCC CGAGCAGCGC CGCAGCGAAC TGATCCAGGA CTACACCGAT
CATTTCGCGA CGCCGTACAT CGCCGCGGAG CGCGGATACC TGGACGCCGT CATTCCGCCG
TCGGTGACTC GGCGCGAGGT CATCCGGGCG CTGCGCCTGT TGCGCACCAA GCGCGCGACG
CTCCCGCCGA AGAAGCACGG CAACATCCCG CTGTAG
 
Protein sequence
MRRSPHQSST WTTDRRTLRG MALPASPQEP TPEPHSTAGR LARLQAVTDE AVHAGSQRAV 
DAQHAKGKMT ARERIEAFLD PGSFVETDAF ARHRSHDFAM DRNRPYGDGV VTGYGTVDGR
QVCVFSQDFT VFGGSLGEVF GEKIVKIMDL ALRTGVPVVG INDSGGARIQ EGVVSLGLYG
EIFYRNVMAS GVVPQLSLIM GPCAGGAVYS PAITDFTLMV DQTSHMFITG PDVIKEVTGE
DVGMEELGGA TTHNSRSGVA HFQAADEADC LELARALLSY LPSNNMDEVP AYTEVADVEA
EVDPELDTFI PDSPNTPYDM HHVIERILDE EDFLEVHAQF AQNMIVGFGR IDGRSVGVVA
NQPMQFAGTL DIDASEKAAR FVRTCDAFNI PVLTFVDVPG FLPGTSQEWN GIIRRGAKLI
YAYAEATVPK VTVITRKAYG GAYDVMGSKH LRADINLAWP TAEIAVMGAR GAVNIIYRRE
LAGSEAPEQR RSELIQDYTD HFATPYIAAE RGYLDAVIPP SVTRREVIRA LRLLRTKRAT
LPPKKHGNIP L