Gene Franean1_4492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4492 
Symbol 
ID5672842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5359224 
End bp5360381 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content71% 
IMG OID641243359 
Producthypothetical protein 
Protein accessionYP_001508775 
Protein GI158316267 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC TGTCGCGTGA GGTCGCGATC GTCGGTGTCG GCTACTCGCC CTTCTCCCGT 
CGGGGGCCGG TCGACCCGCG TCGGCTCGCC TACCAGGCAT GCACGGGCGC GCTCGACGAC
GCGGGGCTCG CCGCACGCGA CATCGAGGGC CTCTACCACT ACCGGTTCGA GGACGACATC
CCGGTACACG AGGTCGCCCG GATGCTCGGC ATGGATGACC TCGCGGTGTT CGCCGACATC
ATCCCGACCA GCCCGTCCGG CCTCGTCGCC GTTCTCGAGG CGATCATGGC GGTGGCGTCG
GGCGCGGCCG AGACCGCGAT GGGGTTCCGG TGCCTGACCC GGGAGACCGG CTACGCGGGG
GCGCTGTCGA GTGACACGGC GCCGGTGGGC GGGATCGAGC AGTACCTCGC TCCCTTCGGC
TGGTCGGGCG TTCTGATGGG CATGGGGATG CGGATGCGGC GCCGCATCCA CGAGCTCGGT
GGTTCCCTGG AGGACTACGG CCAGATCGCC CTGAACGCGC GCCGGTGGGC GGCATTGAAC
CCGCAGGCCG TGCTGCGCGA ACCGATGACG ATGGAGGAGT ACCTCGACGG CCGGCTGGTC
GCCGACCCGC TGCGCGTCTA CGACTGTGAC TATCCGGTCA ACGGCGCGGT CGCCTGCATC
GTGACCACCG CCGAGCGCGC ACGTGACCTG CGGCAGCGCC CGGTTCTCGT CGACGCGATG
GCGTACTCGA ACGGCGTCGC GCCGGACACC CGGTGGGCCT TCGGTGAGGA CTTCCTCTTC
GGCTCCGCGC GGCGGTGCGC GGACCGGCTC TGGTCGCGCT CCTCGTTCAC CGCTGCCGAC
GTGGACCTCG CCCAGCTGTA CGACGGGTTC ACCCACGTCA CGTTGTCGTG GGTCGAGGCG
CTGGGTTTCT GCGGGGTCGG GGAGTTCGGG GACTGGGTGG AGGAGGGCAA GCGGATCGGC
CCGGGCGGGG AGCTGCCTGT CAACACCGGT GGCGGTCACC TCGCGGAAGG CCGAGTGCAC
GGCATCCAGC TGCTCACCGA GGCCGTTCTG CAGCTGCGTG GCCAGGCGGG GGAGCGTCAG
GTTCCCGATG CTTCGGTCGC CGTGGTCACG AACGCGTTCG GCGCCCAGAC CGCCGGCATG
GTCGTCAACG TCGAGTGA
 
Protein sequence
MSTLSREVAI VGVGYSPFSR RGPVDPRRLA YQACTGALDD AGLAARDIEG LYHYRFEDDI 
PVHEVARMLG MDDLAVFADI IPTSPSGLVA VLEAIMAVAS GAAETAMGFR CLTRETGYAG
ALSSDTAPVG GIEQYLAPFG WSGVLMGMGM RMRRRIHELG GSLEDYGQIA LNARRWAALN
PQAVLREPMT MEEYLDGRLV ADPLRVYDCD YPVNGAVACI VTTAERARDL RQRPVLVDAM
AYSNGVAPDT RWAFGEDFLF GSARRCADRL WSRSSFTAAD VDLAQLYDGF THVTLSWVEA
LGFCGVGEFG DWVEEGKRIG PGGELPVNTG GGHLAEGRVH GIQLLTEAVL QLRGQAGERQ
VPDASVAVVT NAFGAQTAGM VVNVE