Gene Francci3_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1300 
Symbol 
ID3904349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1554173 
End bp1556215 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content73% 
IMG OID637878633 
Productheparinase II/III-like 
Protein accessionYP_480406 
Protein GI86740006 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGT CGGTTCCGCT CGGTCGCTAC CTGCGCACCA CCGCGGGACT GCGACCGGTC 
CAGCTCGGTG CCCGTGTCCG GCTGCGCGGG CAACGGGCCG TCCTGAGTCG CCACCCGGGC
CTGGGCGAGA TGCTGCTGCG CGGCAGGCCG CCAGCGGAAT CCTGGCCGAT CGGGTTCCTG
CCGTTCGACG GCAGATGCCC GCCGGCTCGG CCGATGTTCG ACGAGATCGT CGCGGGCCGC
CTAACCCTGC TCGGTCACTC CCGCGATCTG TGTCCCCGGG ACTCCAACAC TCCCGCTACT
GGAACCCCCG TACCCCCCAC CCCCACGACC GGCACCCCCG CGGGGGCCGG CGCCGCCGCG
GCCCCCGCCC GATGGGACTG GAGGCAGGCC GACGCTCCGC TGCTGTGGCG CTACCATCTG
CACTACTGGG ACTGGGCATG GGCATTCACC ACCGAGGCTG TCCAGGGGCC CGCGATGTTC
GCCCGGCTGT ATCTGGCCTG GCGGACCTCG GTGGCCCTGG GCGATCCGGT TGCCTGGTCG
CCCTACGTCG TGTCACTCCG GGCCTGGACG CTGTGTGCCT TGTGGCCCCG GCTCGCGCGG
GGAACGCCCG CCGAGATGGC GGTCCTCGCC GACCTCGGCG TCTGCCGGTC CTTCCTGCGC
ACTCATCTGG AGACCGACGT CGGCGGGAAC CACCTGCTGA AGAACTACAA GGCGCTGATC
GGGCTGGCGG TCGCCGATGA CGACGCCCGC GGCCGCCGAC GATGGGTCGA CGCGCTGCTG
CGCGAGCTCG ATCGGCAGGT CCTCAGCGAC GGTGGACACT ACGAGCGCTC CCCCACCTAT
CACTGTCAGG TGCTGGCCGA CCTCGACGAC GTCGCCGGGC TGCTCACCGC GGCGGGCCAC
GTGGTGTCCG GCGCCCTGCT CGACGCGGCC GGCAGGATGC GGTCCTGGCT GACGGCCGTA
CTCGGACCGG ACGGCGTGGT TCCCACGCTC AACGACGGGT TCGCGGTGCC CGCCGAGGCC
CTGCGGCTGC TTCTCCCGGC TCCGGTAAGA CCGACGCCGA TCGCAGTCCA GGGTCCGGGT
CCCGTTCCGA TCCCGGCACC GCGCCTGGCC GAGATGCCCA CCTCGACGTC GCAGTCGACC
GTGGGGACCG TGCCCCGTCC GCGGACGCCC GCATCTGACC GGGCCGACGC CCTTCTGCTG
GTCGACAGCG GTCTCGCCGT GCTGACGGCT GGACCGTGGC ATCTGCTCGC CGACGTGGGC
CTGCCCTGTC CCGAGGACCT TCCGGCCCAC GCCCACGCCG ACACCCTCGC GTTTCTGCTC
TGGCACGACG GCCGGCCGCT CCTGGTGGAT ACCGGGACCT CGACCTACGC CCCCGGACCG
GATCGGGACG CCGAACGCGG TACCGCCGCC CACTCGACCG TCATCGTCGA TCACGCCGAC
TCGACCGAGG TCTGGGGCGC GTTCCGGGCG GGCCGCCGGG CCCGTCCCAC TCTTGTCACG
ATGTGCCACC ACGAAAACGT GGCGACGTTG GCCGCCGGCC ATGACGGCTA CCGCCACCTT
CCCGGGCGAC CCGTGCACTG GCGGACGTGG CGACTCGACC CGAGTGGGTT GTCCGTCGAC
GATCGGATCA CCGGGGAGGG CCGCCACCAC GTCGAGGTGC TGTTCCACTT CGCCCCCGGG
GTGTCCGTCA CCGCCGCGGC GGCCGATACC AGGCGAACCA GTACCACCTC GATCGGGACC
ACCTCGATCG GGACCAGCCG GAACGCGGCC ACCCGGAGCG GGGCCACGGA TGCGCTCACG
GTGACCACCT CCCAGGGGCG GCTGGTCCTG CGGGCCGGCG GACCGGGGCG GTGGGTGGTG
CGCTCCACCC GCCGGGCGGT CGGATGGAGC CGTACGGTGC CGGCGTACAC CGCCGCGTAC
GTGATCGATG CCGAGCTGCC TGTCGCGGTA CGCACCACGG TCGTCCACGA AGTCCGCTCC
GGCGGCCCGC GCTCCGGCAA CCTGCGCTCC GGCGACACGG AATCGTCGCC CGCCCGCCTC
TGA
 
Protein sequence
MTASVPLGRY LRTTAGLRPV QLGARVRLRG QRAVLSRHPG LGEMLLRGRP PAESWPIGFL 
PFDGRCPPAR PMFDEIVAGR LTLLGHSRDL CPRDSNTPAT GTPVPPTPTT GTPAGAGAAA
APARWDWRQA DAPLLWRYHL HYWDWAWAFT TEAVQGPAMF ARLYLAWRTS VALGDPVAWS
PYVVSLRAWT LCALWPRLAR GTPAEMAVLA DLGVCRSFLR THLETDVGGN HLLKNYKALI
GLAVADDDAR GRRRWVDALL RELDRQVLSD GGHYERSPTY HCQVLADLDD VAGLLTAAGH
VVSGALLDAA GRMRSWLTAV LGPDGVVPTL NDGFAVPAEA LRLLLPAPVR PTPIAVQGPG
PVPIPAPRLA EMPTSTSQST VGTVPRPRTP ASDRADALLL VDSGLAVLTA GPWHLLADVG
LPCPEDLPAH AHADTLAFLL WHDGRPLLVD TGTSTYAPGP DRDAERGTAA HSTVIVDHAD
STEVWGAFRA GRRARPTLVT MCHHENVATL AAGHDGYRHL PGRPVHWRTW RLDPSGLSVD
DRITGEGRHH VEVLFHFAPG VSVTAAAADT RRTSTTSIGT TSIGTSRNAA TRSGATDALT
VTTSQGRLVL RAGGPGRWVV RSTRRAVGWS RTVPAYTAAY VIDAELPVAV RTTVVHEVRS
GGPRSGNLRS GDTESSPARL