Gene Francci3_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1161 
Symbol 
ID3903589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1383816 
End bp1385063 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID637878493 
Producthypothetical protein 
Protein accessionYP_480269 
Protein GI86739869 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGGG TGGCGGGGCT CGGTCGCGTC CCCGGGCGCC CTCACCGGGC TCGCCGGGCC 
TCAGCCGTTC CGGCCGGCGG TGGGTCGTCC ACCGCGCCGA TGCCCACCGC GCCGATGCCC
ACCGCGCCGA TGCCCGAGGC GGCGGCATCC GAGCCACCAG GCGGGCCGCG GGCCGCCGGC
GGGGGGGAGT CCCCGGCTCG GCGGGCCGAC TATCACATCC CGGTCGCGCT GCGGGTCTCG
GCGGGCTGGT CGTGGCGTCT GATCGTCACG GGCGCGGCCA TCTACATCCT GCTGATGGTC
GTGGGACGGG TCCGGATCGT GGTGATTCCG CTCATCGCCG GGCTGCTCAT CGCCGCTCTC
ATCCATCCGC TCGCGCACCG GCTGCAGCGG CTCGGTATGA ACCGGCTGGG CGCTGCCTTC
ACCGCGTTGT TCGTGTTCTT CGCGGTGTTC GTGGGTGCCG GGGTGGCGGT GGGCTTCAAC
GCAGCGAACG AGATCCCCAC GGTCAGCGAT CAGGTCAGCG AGGGCGTCGA GCAGATCCGC
GGCTATCTGA GAAACGGTCC GCTCCACCTG TCACAGAGCC AGATCGACAA TCTGGTCGAC
GACATCCGCA AGAGCCTGGC GAACAACCGG GGGCGCCTGG TCTCCGGGGT GATCTCCGGC
GCCTCGGTGG CCGCCGAGGT GCTCACCGGC CTGCTCGTCA CCCTGTTCTC GACCTTCTTC
TTCCTGTACG ACGGGGATCG TATCTGGAAC TGGATCGTCA CCCGCTTCCC GGACGGAGCG
GAGGAGCGGG TGCGTGGCGC CGGTCGGGAG GCGTGGAACA CGTTGACCGG CTACATTCGT
GGGACGGTCT TTGTCGCGGC GGTCGACGCG ATCGGTATCA CCATCGGCCT GGTCGGCGTG
GGTGTCCCGC TCGTCGCGCC GCTGGCCCTG CTGACGTTCT TCGGCGGGTT CGTCCCGATT
GTCGGGGCGA CGGTGGCCGG CGCCGCGGCG GTACTGGTGA CCCTGGTCTC GAATGGCGTG
CCGGATGCGT TGATCATTCT GGGTGTCGTG TTGGCCGTGC AGCAGGTCGA GGGGCACCTG
TTGCAGCCGC TGGTCATGCG GCGGGCGGTG CGGTTGCATC CGCTCGCGAT TGTCATCGCG
CTGTCGGCCG GTGGCGTGTT GGCCGGCATC CCCGGTGCCA TCGCCGCCGT TCCGTTCGCT
GCCATGGTGA ACAGGGTCGC CGGTTATCTC GCCGTGAGCG GGAAGTGA
 
Protein sequence
MRRVAGLGRV PGRPHRARRA SAVPAGGGSS TAPMPTAPMP TAPMPEAAAS EPPGGPRAAG 
GGESPARRAD YHIPVALRVS AGWSWRLIVT GAAIYILLMV VGRVRIVVIP LIAGLLIAAL
IHPLAHRLQR LGMNRLGAAF TALFVFFAVF VGAGVAVGFN AANEIPTVSD QVSEGVEQIR
GYLRNGPLHL SQSQIDNLVD DIRKSLANNR GRLVSGVISG ASVAAEVLTG LLVTLFSTFF
FLYDGDRIWN WIVTRFPDGA EERVRGAGRE AWNTLTGYIR GTVFVAAVDA IGITIGLVGV
GVPLVAPLAL LTFFGGFVPI VGATVAGAAA VLVTLVSNGV PDALIILGVV LAVQQVEGHL
LQPLVMRRAV RLHPLAIVIA LSAGGVLAGI PGAIAAVPFA AMVNRVAGYL AVSGK