Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3995 |
Symbol | |
ID | 3906956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4779225 |
End bp | 4780832 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881324 |
Product | hypothetical protein |
Protein accession | YP_483074 |
Protein GI | 86742674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGGTG GCGTCATGAG CAAGCTCAGC CGGCGGGTGG AGCAGGAGGA GTTGCGGGCC CGGATGCGCG CGGTGGGTAT GTCCCACGAC GAGATCGCTG TCGAGTTCGC CCGCCGCTAC AAGTTGCGGC CCCGTGCCGC CCACCGCCAC GCCCGCGGCT GGACCCAGAC GCAGGCCGCC AACCACATCA ACACCCACGC CGCCCGCGTC GGCCTCGACC CGGACGGCGC CGCACCCATG ACCGGCCCGA AGCTGTCGGA GCTGGAGAAC TGGCCGTTGC CGAACAACCG CCGCCGGCCC ACCCCCCAGA TCCTCGCCCT GCTCGCCGAG GTCTACGACA CCAGCATCCA CAACCTGATC GACCTTGACG ACCGCGAACA GATGCCTCCT GCTGATCTGC TGCTCATCAC CACGATCCGT GAGAGGGCCG TGCCGGCGAG CGCTATCGGT TCGCTCCCAC AAGCGCAGTC GGCAAGCTCG GAGGTCACGC CGATGGTGGA TGCGCCGAAT CGGCAGCAGT TCCTGCTCGC GGCGTCAGCC TTGGGCGTTG CCGTGGTGCT GCCCCGACAG CCGGCCGCGC CGTCCCCGTC CGTGAGCGTC CGGTCGACCG TGTTCCCGGC CGCATGCGAT CTTCTCGCTG ATCTGCGAGA AGCCATCACG GCACCGGCGG AGTGGTCCAC CGATCCCGAC CCGGCCTCAT TCGCGGGCCC TGCCGACCTT GACGCTCGCG CGCGGGAGTG CCACGACCGC TACCAGCGGG CCGACTACGC AGGCACGGCG AGACTTCTCC CCGCGGTGGT GCGGGGCATC GAGACACTCA CGGTCGATCC GCCGACTGGC GTGAACCACC GCGCGGTCCG GCGGACGCAG GCCGTCGCGT ACATCGCTGC CGCCAAGCTC GCGACCAAGA CCGGCGACCA CGATCTCGCC TGGCTGGCCG CAGACCGTGG CCAACACGCG GCGCTCGCTG CCGACGCGCC AGCGCTACTG GCGACAGCGC GCAGACAGAT CGCCTGCGTC TTCCACGACA CGGGACGGCT GGCCGACGCC GAACGGGTCG CGGTCAGCGC CCTCGACGCC CTGAACCAGC GACCAGGCGA CGAGGACCAC CGCGACCTCT CGTCCGCGCG GGGCGCTCTT CTTCTGCTCT CGGCAATGAC CTCGATCCGC CGAGGCGAAC GGACGGAAGG CCGCCGCCGG CTCACCGCCG CGGCCGAGCA GGCTGACGCG CTTGGCCGGG ACAACAACCG GCTGTGGTCG GCGTTCGGGC CGACGAACGT CGCGATCCAC ACCCTTACCG CCACCCTGGT ACTGGACGAT CCGACAGAGG CGGTCGGCGT CGGCGAGCAG ATCGACACAC GTCTGCTGCC ACCCCCGCTG GCCGGCAGGC GCGCACGTCT GCACGTAGAT CTTTCCGGCG GGCATGCCCG CCTGGGCGAG GATGCCATCG CGGCGGTGCA CATCCTTGAC GTCGCCCGCC GGGCGCCGCA GCTGCTGAGG GTTGATCCGA CAGCTCGGGC TGTGCTGGCG ACACTGCTCG GCCGTGCCCG CGGCTCCACC GTCTCGGTCC TACGGAGTGT CGCGGAGCAG GCCGGAGTCG CAACGTGA
|
Protein sequence | MCGGVMSKLS RRVEQEELRA RMRAVGMSHD EIAVEFARRY KLRPRAAHRH ARGWTQTQAA NHINTHAARV GLDPDGAAPM TGPKLSELEN WPLPNNRRRP TPQILALLAE VYDTSIHNLI DLDDREQMPP ADLLLITTIR ERAVPASAIG SLPQAQSASS EVTPMVDAPN RQQFLLAASA LGVAVVLPRQ PAAPSPSVSV RSTVFPAACD LLADLREAIT APAEWSTDPD PASFAGPADL DARARECHDR YQRADYAGTA RLLPAVVRGI ETLTVDPPTG VNHRAVRRTQ AVAYIAAAKL ATKTGDHDLA WLAADRGQHA ALAADAPALL ATARRQIACV FHDTGRLADA ERVAVSALDA LNQRPGDEDH RDLSSARGAL LLLSAMTSIR RGERTEGRRR LTAAAEQADA LGRDNNRLWS AFGPTNVAIH TLTATLVLDD PTEAVGVGEQ IDTRLLPPPL AGRRARLHVD LSGGHARLGE DAIAAVHILD VARRAPQLLR VDPTARAVLA TLLGRARGST VSVLRSVAEQ AGVAT
|
| |