Gene Franean1_6662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6662 
Symbol 
ID5674977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8091387 
End bp8092847 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content77% 
IMG OID641245513 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001510905 
Protein GI158318397 
COG category[R] General function prediction only 
COG ID[COG2144] Selenophosphate synthetase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0223324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.900242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTC TGCTGTCGGG GGGGCACACG CCGATCCGGC ACCGCACCGT GGACACACTG 
GCGATCTGGG GTGATCGCCG GACGCTGGCG GGCCAGCCGC CGTACCGCGT CGAGCAGGCC
GAGGATCTGG CCACCTTCAC CGCCTACGCC CGTCTGCGGC GGGAGGTCTT CGTCGACGAG
CAGGGCCTGT TCTCCGCGAC CGTGGCCGGC GATCTGGACG AGGTCGACAG CGACCCGCGC
AGCATCGTCC TGGTCGCCCG GGTCGTCGGC GGGCCGGATG ACGGCACGGT GATCGGCGGG
GTGCGGTTGG CGCCGATCTG GCGCGGCGAG GACATCGGCG CCTGGCAGGG CGGCCGGCTC
GTCGTCGCGG CAGCCGCCCG CGGGCGCTAC GCGGGGATCG GTGCCGCACT GGTCCGGGCG
GCGTGCGCGC GCGCCGAGAA CGAGGGGGTG CTGCGCTTCG ACGCCGCGGT GCAGCCCGAC
CGCGCCCGCT TCTTCGGCCG GCTCGGCTGG ATGATCGCCG GGACGACCAC GGTCGCCGGC
CGCCCGCACG TGCTCATGCG CTGGCCCATC AACCGGCTGG CGGCGGTGGC GGCCTCGATC
AAGGCCCCGC TGGCGACCCT GCTGGCCGGG ATGCGCCCCG GCGGCCCCGG GTTCGTCGGT
GACGACGGGG CACCGGTGCC CGGCACGGAC GTCGTCGCGG CCTGCGACGC GATCGTGCCG
TCGATGGTCG AACGCGACCC CTACTGGGCC GGCTGGTGCG GCGTGCTGGT CAACCTCAAC
GATCTGGCGG CGATGGGCGC CCGGCCGGTG GGCATGCTCG ACGCGGTGGC CGGCCCGACG
GCGTCCCGGG TCGCCCGGGT GATCGGCGGG CTGCGGGCGG CGGCGGAGCG CTACGGCGTG
CCCATCCTGG GCGGCCACAC CCAGCTCGGG GTGGCGGCCG CGCTGTCGGT GACGGCGCTG
GGCCGCTCCG AACGCCCGAT CCCCGCCGGC GGCGGCCTGC CGGGGCACGC GGTGACCCTG
ACCGCCGACC TGGGCGGTGA CTGGCGCCCG GGGTACTCCG GCCGGCAGTG GGACTCGACG
TCCAACCGCC GGACGGCGGA GCTGCGCGCG CTGCTGGACC TGCCGCGCCG GCACCGCCCG
TGCGCGGCGA AGGACGTCAG CATGGTCGGG ATCGTCGGCA CGCTCGGCAT GCTCGCCGAG
GCGAGCGGGT GCGCCGCCGA GCTGGACGTG GCCGCGGTCC CCCGGCCGGC GGGCGCCACC
GTCGGCGACT GGCTCACCTG CTTCCCCGGT TACGCAATGC TCACCGCCGA CGTCGACGAC
CGCCCCGTTC CCGCGCCGAG CCCGGCGACG TCGCAGCGCT GCGGCCGGCT ACTCAACGGG
ACGGGAGTGA CCCTGCGCTG GCCGGACGGT GTGCTGACCC CGGCGCTGTC CGGCCATGTG
ACGGGGCTGG GCCACGCGTG A
 
Protein sequence
MSSLLSGGHT PIRHRTVDTL AIWGDRRTLA GQPPYRVEQA EDLATFTAYA RLRREVFVDE 
QGLFSATVAG DLDEVDSDPR SIVLVARVVG GPDDGTVIGG VRLAPIWRGE DIGAWQGGRL
VVAAAARGRY AGIGAALVRA ACARAENEGV LRFDAAVQPD RARFFGRLGW MIAGTTTVAG
RPHVLMRWPI NRLAAVAASI KAPLATLLAG MRPGGPGFVG DDGAPVPGTD VVAACDAIVP
SMVERDPYWA GWCGVLVNLN DLAAMGARPV GMLDAVAGPT ASRVARVIGG LRAAAERYGV
PILGGHTQLG VAAALSVTAL GRSERPIPAG GGLPGHAVTL TADLGGDWRP GYSGRQWDST
SNRRTAELRA LLDLPRRHRP CAAKDVSMVG IVGTLGMLAE ASGCAAELDV AAVPRPAGAT
VGDWLTCFPG YAMLTADVDD RPVPAPSPAT SQRCGRLLNG TGVTLRWPDG VLTPALSGHV
TGLGHA