Gene Francci3_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3323 
Symbol 
ID3904109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3937627 
End bp3939327 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content76% 
IMG OID637880648 
Producthypothetical protein 
Protein accessionYP_482409 
Protein GI86742009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.34323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.797481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC CGGTCTACCA TCTCGCCCCG CGCCGAGCCG GGACCGCGCT GTTCGGCGCC 
AGCGTCCCGC AGGTCATCCT GATTGGCCTC GGTGTCGGCG GGCTCGCGGC GGGCCCGAGG
CTGCTCGGCG GCGGTTCCGG CACGACGGCC GGGGTTGGTG TGGCGGTGGC CTGCCTGCTG
TTCGCGTTCG TCCGGGTCGG CGGGGAACCC CTGGTCCATC TCCTGCCGGT CGTCGCCGGC
TACCTGCTGC ACACCCGCCT GCTGCACACC CGCCTGCTGC ACACCCGCCT GCTGCACACC
CGCCTGCTGC ACACCCGCCT CCTGCACACC TGCGGCGGAT GCCGGCCGTG CGCCGTCCCC
TCCTCCGCTC GGCCGACGGG CATCGGGGCG GGGGCGGCCG GGTGGGGATC CGCCGGCCGC
CGGTCCGAAC GGGTGGATCT TCCCGCGGTG CCGCGGTCGG TCGAGGTGGT AGCGGCGGCG
ACCGGTTGCC GGGTACCAAC CGCAGACGGG CAACCGGCCG GTCTGGTTCG TGACCGGCGC
ACCGGGACGA TCACCGTGGT CCTCGACGTG CGCGGTGGCC CGTTCGGACT GCTCGACGGC
GCCGGGAAGG ACCGCCAGAC CGCCGGATGG GCCCGGGTGC TGACCCAGTT CGCCCGGGAA
ACCCCGGTGG CCCGGCTCGG CTGGACGGTC CGCTCCGGAC CGGCGACCGC CCTGGACCTC
CCCGTCGAAC CCCGCCGCCA GCCGGAATCC GCGGCGGCGG CCCGGTCGCG GCAGCCGGCG
CGCCCGCCGG CGGGGGAGCT GCTGGCCTAC CGGCGGCTCC TCGCCGAGGC ACAGCCCGCC
CTGATCCGCC ACGACCTGCG GCTATGGCTG ACCGTCCGCC CGACACGCGG AGGCCGCCAC
GCCGACGGCC GGGCCACCGC GCTGGCCGCC GCCGAAACCC TGGCCGACCG ATGCGCCAGC
GCCGGCCTGC ACGTCCGCGG CCTGCTGTCC ACCGCCGAGC TCACCAAGAC CGTGCTCGAC
CACGCCGACC CGCCGCCTCC CGAAGCCTCG AAGGCCCTCG AAGCCCCCAG TCGTGCGGCG
GAGCCGGACA GCGCATCGAC TCCGGGCCTG GCGGCCCGCG CCCATCTCCC GGGAGCCGGC
ACACCGCGAC CGCCGCAGCG CCTCCAGCCG GACAGCCTCA CGCTGCGGGC GTGGTGGGAC
GCGGCCCGGA TCGGCGACAG CTGGCACCGG GTGTTCTGGA TCGCAGGCTG GCCCACCGGC
GGACTGCGCC CGGGCTGGCT CGACCCCCTG CTCCATGACG TTCCCTGTGT CCGCACCCTC
GCGCTCACAA TGACCCCGGT GCCGTGGCGG GTCTCCCGCC GGCGCATCAA CAGCGACACC
GTCTCCGTCG ACACCGCCGT CCACCTCCGC GACCGGCATG CCTTCCGCGT CCCCGTCCAC
CTCACCCAGG CCCACGACGA CATCGACCGC CGCGACGCCG AACTCACCGC CGGCTACCCC
GAATACGCCT ACCTCGGCCT CCTCGACGTC ACCGCCCCCA GCCGGCACGA CCTCGACGAC
GCGTCCGCCG CGATCGTCGA CCTGGCCGCC CGCTGCGGCA TCGTCGACCT GCGCCCCCTG
CACGGCCGGC ATCACACCGC CTGGGCCGCC ACCCTCCCCC TCGGCCTCGC ACCCCGGCCG
ACCGTCACCG GAGCACCCTG A
 
Protein sequence
MSGPVYHLAP RRAGTALFGA SVPQVILIGL GVGGLAAGPR LLGGGSGTTA GVGVAVACLL 
FAFVRVGGEP LVHLLPVVAG YLLHTRLLHT RLLHTRLLHT RLLHTRLLHT CGGCRPCAVP
SSARPTGIGA GAAGWGSAGR RSERVDLPAV PRSVEVVAAA TGCRVPTADG QPAGLVRDRR
TGTITVVLDV RGGPFGLLDG AGKDRQTAGW ARVLTQFARE TPVARLGWTV RSGPATALDL
PVEPRRQPES AAAARSRQPA RPPAGELLAY RRLLAEAQPA LIRHDLRLWL TVRPTRGGRH
ADGRATALAA AETLADRCAS AGLHVRGLLS TAELTKTVLD HADPPPPEAS KALEAPSRAA
EPDSASTPGL AARAHLPGAG TPRPPQRLQP DSLTLRAWWD AARIGDSWHR VFWIAGWPTG
GLRPGWLDPL LHDVPCVRTL ALTMTPVPWR VSRRRINSDT VSVDTAVHLR DRHAFRVPVH
LTQAHDDIDR RDAELTAGYP EYAYLGLLDV TAPSRHDLDD ASAAIVDLAA RCGIVDLRPL
HGRHHTAWAA TLPLGLAPRP TVTGAP