Gene Francci3_0648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0648 
Symbol 
ID3902982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp739854 
End bp740924 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID637877981 
ProductPGAP1-like 
Protein accessionYP_479761 
Protein GI86739361 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00162479 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.67184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCGG ACAACATCGC CGCTCCAGGG CGGTCGGTCC TCACCGGCCT GCGCACGACC 
CCGGTGAACG CCCGCGGGCT CGCCGTGGAA GCGGCCTGGA TCGCCACCCA CCTCGCCCTC
TACCCGGCGA GTGCGCTGCG CCGGCAGCGC CCGCGCGAGC ACGAGCCCTA CTCGCTGAGC
GCGCTCTCCC CGCTGCACCG CAGCCTGCTG GTCAACGCTC CCGACGCGGC AGGCACCCCG
ATCCTGCTGA TCCACGGCCT GATCGACAAC CGGTCCGTGT TCACCCGGTT GGGCCGGTCG
CTACGCCGCC GCGGTTTCCG CCGGGTGCGC ACGGTGGAGC TGCCGCTGCT CGTGCCGACG
GTGCAGGAAG CGGCGCTCCG ACTCGCCGCG TCCGTGCACG CCGCGATGAC GGACAGCGGC
AGGCAGCGCG TGCACATCGT CGCCCATTCA CTCGGCGGGC TGGTGGCCCG CTACTACGTG
CAGCAACTCG GGGGCGATCA GTACGTGGAC ACGCTGATCA CTCTCGCGAC GCCTCATTCC
GGCACCCGTC TCGCCGGGCT CGTCCCCCGG TCGGTGCCGT ACCGGCTCGT CACCCAGCTA
CGGCCGGGAT CGGCGCTCCT ACGCGAGCTC GCCGCACCCG CCCCCGGCTG CCGGACCCGG
TTCGTCGCGA TCGGCGCCGG GCTGGACAGC GTGGTGCGGC CCGCCGAGGC GGCGCTCGAC
CATCCCGATC TCGACATCGA AAATTACACC GTGCCGGGCC TCGGGCATCA TTCCCTGGCG
TTCAGCGGCA AGGTTGCCCA CCTGGTCGCG AGCTGCCTCG CCGGGGCGGC GGACCGGCCG
GGACTCTCGG GACCGTCAGA GCTTCTCGAC GGGGGCGTAC CGCAGGAGCA GGCGCTTCGT
GCCGGTGTTC TCGCCGAAAT CGATGGTCGC CTCCGCGGAG TCACCGACCC CGCCGGTGGC
GACGACAACT CCGAGACCGA AACTGTCATG CGTCACCCGG TCGCCCTGAC GCAACTCCAG
GACGGCCCGA GCGGCCGGCC GCGCGCGACC CGAGAACGGC GAGTTCGGTA G
 
Protein sequence
MAADNIAAPG RSVLTGLRTT PVNARGLAVE AAWIATHLAL YPASALRRQR PREHEPYSLS 
ALSPLHRSLL VNAPDAAGTP ILLIHGLIDN RSVFTRLGRS LRRRGFRRVR TVELPLLVPT
VQEAALRLAA SVHAAMTDSG RQRVHIVAHS LGGLVARYYV QQLGGDQYVD TLITLATPHS
GTRLAGLVPR SVPYRLVTQL RPGSALLREL AAPAPGCRTR FVAIGAGLDS VVRPAEAALD
HPDLDIENYT VPGLGHHSLA FSGKVAHLVA SCLAGAADRP GLSGPSELLD GGVPQEQALR
AGVLAEIDGR LRGVTDPAGG DDNSETETVM RHPVALTQLQ DGPSGRPRAT RERRVR