Gene Franean1_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1167 
Symbol 
ID5669580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1389709 
End bp1391124 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content77% 
IMG OID641240099 
Producthypothetical protein 
Protein accessionYP_001505527 
Protein GI158313019 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3509] Poly(3-hydroxybutyrate) depolymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.428408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.287821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCG GGAACTCCGC CGGTCGTCGG CGTAGCCACG GCCGGGCCGG GGCCGGCTCC 
GCGCGGTTGG GGGCCGCCCG GCAGGGGCTG GCCCGGTCGA GCGCGCTGCG GCACGGGTCG
CGTCCGGCGA ACACACTGTG GCTGCCGCTC ACCGCGGCGG CGGGGGTGCT CCTGCTCCTG
GTCTGGTACG CGATGTCGCA GGATCCCGGG GAGGAGACCG GTTCCTCGTC CCCGGTCGCG
TCCGCGCTGG GCGGAGCGGG CGGTGCCTCC GGCGGGCCCG CCGCCGTGGC GGCACCGACC
GTCCCGCCGC CACCGCGCAG CGGCTGCGAG ACCGGCAGCA GCCTGCCGGC GGGCGTCACC
ACCGAGACCA TCCACATCGG CGACGTCGAC CGGCAGTTCC TGCTGGCCGT CCCGCAGGGC
GGCGGGGCGG GTACCGCACT GCCGCTGGTG CTGAACCTGC ACGGTTACGG GCAGCCCGCA
CAGGAGCTCG AGCAGTACAC CCAGATGGCG GACAGCGGCA CCCGGGCCGG CTACGCCGTC
GCCACCCCGG TCGGGGCGTC GAACCGGTGG AACTTCACCC GCTCGGCCAA GGTGGGCCCG
GATGACGTTC TCTTCGTCGG CGCGCTGCTC AACGACCTGA CCGGCCGGAT GTGCCTAGAC
CCGAAGCGGG TGTTCGCCAG CGGCTTCTCC GACGGCGCCG ACATGGTCCT CGCGCTGGCC
TGCGCGCTGC CCGAGCGCTT CGCGGCGGTG GTCACGGTCG CGTCGAGCCG CTTGCCGGAT
GCCTGCGCCG CGCCACGGGC CAGCCTGCTC GAGCTGCACG GCACCGACGA TCGGATCGCG
CCGTTCAACG GCGGGGGCCA GGCCCGCACC GCCCCCTACA ACGGGCTCGT CGCCCAACCG
GTGGAGGCCC GCACCGACAG GTACGCGCGC GCCCTCGGCT GCGGCGGCGA GCACTCGACG
GCGCAGGACT TCACCCACGT CGCGCGGACG ACCTGGACGA CCTGCCCGGC CGGCAAGGAC
GTCGGCGCCC TGGCGATGCA GGGCGGCGGG CACACCTGGC CGGGCGCGGC ACCACGGCCC
GAGCTGGGTG GCACGGTGAC AAACCTGAGC GCGACCGTCG TGGCGCTGGC CTTCTTCAGC
GGCCACCCTG CCGACGGCCG GACGGTGACC GGCAGCGGCA GCTCCGGAGG CGGCGCGCCC
GCGACGGGCG GCGCGGCCAC GGGCGGCGCC ACCGCGCCGA CCCAGCCGCC GGCGGGGGCA
CCCGCGACCA CCCCGCCCGC CTCGGCGGCA CCCGGTGACC AGCCCGAGCC CACACCGACC
GGGGCGAGCG ACGATCCCGA CCCGGCGCCG TCGGCGCCGC CCGGCCCGTC CACGAGCGCG
ACGCCGACCG GTGAGGCGCC CGCGGGCGCC GGGTAG
 
Protein sequence
MATGNSAGRR RSHGRAGAGS ARLGAARQGL ARSSALRHGS RPANTLWLPL TAAAGVLLLL 
VWYAMSQDPG EETGSSSPVA SALGGAGGAS GGPAAVAAPT VPPPPRSGCE TGSSLPAGVT
TETIHIGDVD RQFLLAVPQG GGAGTALPLV LNLHGYGQPA QELEQYTQMA DSGTRAGYAV
ATPVGASNRW NFTRSAKVGP DDVLFVGALL NDLTGRMCLD PKRVFASGFS DGADMVLALA
CALPERFAAV VTVASSRLPD ACAAPRASLL ELHGTDDRIA PFNGGGQART APYNGLVAQP
VEARTDRYAR ALGCGGEHST AQDFTHVART TWTTCPAGKD VGALAMQGGG HTWPGAAPRP
ELGGTVTNLS ATVVALAFFS GHPADGRTVT GSGSSGGGAP ATGGAATGGA TAPTQPPAGA
PATTPPASAA PGDQPEPTPT GASDDPDPAP SAPPGPSTSA TPTGEAPAGA G