Gene Franean1_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0167 
Symbol 
ID5668592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp200636 
End bp201874 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content78% 
IMG OID641239096 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001504540 
Protein GI158312032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.10739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCG GAGTCGCCGT GCCGCCGCTC ATCGTGGCGC CGGGGCGTGC CGGCGGGCCG 
ATGATGCCGT CCGCGTCGGC GGTGGTCCCG TTCGGGGGAG CCGGCTCGGT AGCCTCATCC
TCGTGGTCGA AGTGTCCGCG AGGTGGCCCC GAGGTGTGCC GGCGTGCCCG TCCGGTGTCT
CCGGCCACCC GGAAACGAGA ACCCGTCGCG CGGCGGGCGC GCCGGGGTGG CTGCGCGTCC
GGGGGCCCCG TGACCGCTGA TCCGGCGAGT GTCGCGGGTG GCGATGCGCT GGTCGCGCCG
AAGCGCGTGC TGCCCCCGGA GGACGTCGAG CTGGCCCGCG AGCTTGAGGA GGTCGCACAC
CGCGCCTGGC CGCCGCTGCG CGAGTGGACC CACGGCGGGT GGGTGCTGCG GGAGTCCGCG
GGCTCCTCCC GCCGGGGTAA CTCGGTGTGG GCCCGTGGCG ACGTCCCCGA CCTGGCCGCG
GCGCTGCGCG CGGTCCACTC CTTCTATACG GCGGCCGGCC TGCCGCCGAC GTTCCAGATC
ACCCCGGTGG CCCGGCCCGC CGGCCTGCTC GACGCGCTGG ATGCCGCGGG TTACGACGAC
GGCGGCCCGA CCGACGTCTG CGTCGCTCCC CTGGCCGCCC TGCGCGCGCC CCGACCGGAC
GCCGCGGAGC ACCGCCCCGG GCCGGCGGGC GGGCCGGGCG CTGACCGCCG CGTCGAGTCC
GCCGCCGCCG ACCTGCCCGA CGAGCGCTGG CTCGCGGTGG CGGGGGACGT CCTGGCCACG
TTCGCCGGCC AGCGCGTCGG CACGCTCGCG GTGGTTCGCG CGATGGCGCT GCCCCAGCGC
TACGTGACGG TCTTCGTCGA CGGCCGTCCG GTCGGCGTCG GGCGCGGGGT CCTGGACGGC
AGCTGGCTCG GGATCTACAG CATGGCCACC CTCCCGGCCG CCCGCGGCGT GGGCGTCGCC
GGCCGCACGC TCGCCGAGCT CGCGCACTGG GCCGGGGCGC GGGGGGCCGA GCGCGCCTAC
CTCCAGGTCG AGCGGCACAG CGTGGTGGCG CGCGGGCTCT ACGCCCGGCG CGGGTTCCGT
CCCGTCTACG GGTACAGCTA CCGGCGGCTG CCGGCACCGT CCGGGCTCCA GCGCAGCGCC
GTGGCGCCCC GCGTGGCGAC GGAAAACGTG GCATCGGAAA ACGTGGCATC GGCGTGGCCT
GCGGCCGGCC GAGCTGGCGG CGGGACGGCC AGCCGATGA
 
Protein sequence
MPLGVAVPPL IVAPGRAGGP MMPSASAVVP FGGAGSVASS SWSKCPRGGP EVCRRARPVS 
PATRKREPVA RRARRGGCAS GGPVTADPAS VAGGDALVAP KRVLPPEDVE LARELEEVAH
RAWPPLREWT HGGWVLRESA GSSRRGNSVW ARGDVPDLAA ALRAVHSFYT AAGLPPTFQI
TPVARPAGLL DALDAAGYDD GGPTDVCVAP LAALRAPRPD AAEHRPGPAG GPGADRRVES
AAADLPDERW LAVAGDVLAT FAGQRVGTLA VVRAMALPQR YVTVFVDGRP VGVGRGVLDG
SWLGIYSMAT LPAARGVGVA GRTLAELAHW AGARGAERAY LQVERHSVVA RGLYARRGFR
PVYGYSYRRL PAPSGLQRSA VAPRVATENV ASENVASAWP AAGRAGGGTA SR