Gene Francci3_1510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1510 
Symbol 
ID3904976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1801242 
End bp1802486 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content71% 
IMG OID637878847 
Producthypothetical protein 
Protein accessionYP_480615 
Protein GI86740215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACT ACTTCTTCAT CTCCTATGCG GCAGATGAGG ATCGGCGCTG GGTGCAGCGC 
TTTCACGCCG ATCTCGAGTA CGAGCTGCGG GTTCAGGTCG GCTCCGCGGT GGGCGGAATC
CTGGACACCC GTCCGCGTCC CGGCATGGAC GCCGACCTGG CCCTGGCCGC CGGCGCCGGC
GGAACCCGCG TGATGGTGGC GCTCTGTTCC GATCCCTTTT TCAGGGACGC CTGGTGCGGG
CGCGAGTGGG AGGTCTTCCA CTCGCGCATC GAAAATTTCA GCAAGGCCGG TGCGACCAGA
CTGGAGGACG GTTTCCTCCG GGTGCTGTGG CGTCCGACGC AGGATCCCGT CCCGGCGGTC
GCGACGAGGG ATCTCGCGGA CGCGAGCGCG GGACTCCCCG ACACCTATCG CCAGCGCGGC
CTGCTCTGGA TGATGCGCAA GATGCTGCTC GGGGCGGGCG GGTACTACTG GTTCGTCCGC
ATGCTCGCGG CGCGGATCGT CGCGGCCCAG CAGATCACCC TGGACCCGGT GCCCGACCTC
GCGGTGCGCC GCGCCGCCGC GGCCTTCGGC TCCCACCCCG GCTCGTCCGG GGACCCGTCG
GAAGCGATGC CCCCCTTCCT TCCGCTGGCC GGGTCCGTGA ACGGGAACGG GAACGTGAAC
GGGAACGGGA GAGCCGGTTT CCGGACCCAC GGCGACGGCG ACGAGGCGCG GCGCGCGGCT
TCGATCGGGA CGGCACCCCC CGTGCCCACG CCGACTACCA CCTCCCGCCG AACCACGCCG
CCGGTGGCGT CTGACGTACC GTCGCCGCAG ACGCCGCTGG CGGAGGTCTC CCGTCTGGTC
GCCATCAGCT ACGTCGGCGC CGACCAGGAG TGGGCGGACT GGCTCGAATA CCTGCTGCGC
CGCGGCTCCC ACCGGGTCGT GCAGGTCAGA TGGGCGCAGA GTAGAGGTGA GCGGCTGGCC
GAGACGGTTG ACCGCATCGC CGCGCGCCGG CCGGATGTCA CCGTGGCCCT GCTGTCGCGG
CACTACCGGC CGCCCCGCCC GGAGAACCCC GGCGAGACCG AGATCGAGGC GTGGGAGCGG
CTGGGGCTGG CGGGACCGCT CGAGAATCAG GTCATCCGGG CGGTCATCGA CCGCGAACCG
TTACCCGAGC CGCTGCGGAC GCTGCCGAAG CTGGATCTGA GCCGGTTCGA GCCGACCGCG
GTCGACGGAC TGCTGGCGGG GATACGGGCG GGAGCCCGGC GATGA
 
Protein sequence
MKNYFFISYA ADEDRRWVQR FHADLEYELR VQVGSAVGGI LDTRPRPGMD ADLALAAGAG 
GTRVMVALCS DPFFRDAWCG REWEVFHSRI ENFSKAGATR LEDGFLRVLW RPTQDPVPAV
ATRDLADASA GLPDTYRQRG LLWMMRKMLL GAGGYYWFVR MLAARIVAAQ QITLDPVPDL
AVRRAAAAFG SHPGSSGDPS EAMPPFLPLA GSVNGNGNVN GNGRAGFRTH GDGDEARRAA
SIGTAPPVPT PTTTSRRTTP PVASDVPSPQ TPLAEVSRLV AISYVGADQE WADWLEYLLR
RGSHRVVQVR WAQSRGERLA ETVDRIAARR PDVTVALLSR HYRPPRPENP GETEIEAWER
LGLAGPLENQ VIRAVIDREP LPEPLRTLPK LDLSRFEPTA VDGLLAGIRA GARR