Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4198 |
Symbol | |
ID | 3907163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5011160 |
End bp | 5013094 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881526 |
Product | hypothetical protein |
Protein accession | YP_483275 |
Protein GI | 86742875 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC CCTCCAAGGT CGCCTTCGCC CTTCCCATAC CGGCTCTTGT GCGGGTGCGC GGAGCGATCG ACGAGGCCTG GCACGAGGCG CTCGCCGACC TTCCGGACGC TGTGGGCAGC ACCGTGTCCC TGGGCATCGG GTGGGACCTC GCCTGGGAGC GCGAGCAGTG GCGATCGGCT GTCGAGGACA GGCGCAGCCA CCTCTCCGTG CGCCTGTACG CGGACGAGAC TCTGATCGGC CCCCTGTGGG CGCCCGCGAC CGATGCCGGA TGCAGCGGAT GCGCCGAGGT CCGCTCCCGG GTGGTTCGCG CCCACCCGAT GGTCGAGGCA CTGCAAGTGC CCACCGAGGT CCCGCTTCCT CGGTCTCCCC TGCTGCCAGA GCTGCTGGCG GCCGCCGTAC AACACCTCAC CGCCCAGCCT CTCGGGCCCG GTGAGCTCTA CGCGGTCGGC ACCGGGTCAA CCCGCCGCCA CCGGGTTCCA CGCAGCTCCG GCTGCCCTCT GTGCGGCGCC CGCACCACTG GCACCGCAAC CGCTCCCCCG CCCGGCAGGC TGGTGCTGCA CGACCACCCG GCGGACCCTC ACGACCCCAC CCGGGGCCTC ACGGGCCGCC GGTTGCTCGC CCCCGGCGCC CTGCGCCGTC GTACCGTCGA CCCGCGGTTC GGCCCAGTAC GGGAAGTCGT CCGCGAGTCG CGCGCCCCCT ACGCGATGAG CATGGCGGCA CTGCCCAACG CACCCGCCAT GGGATACGCG CGGGCCGTCG ACTTCGAAAC GGCCGAACCG GTCGCGGTCC TGGAGGCTTA CGAGCGGCTG GGAGGCTTTC CGTACGAGGC TTCGGTGATC GAGGACGTGG CCTACCAGGA GGTCGCAGAA CACGCCGTGG ATCCCGCCTC ACTCGGCGGG TACACCGCGC AGCAACTGGC CCATCCGAGC ACTCGCGTGA CCCCGAGCTT CCCGTCCACG CCCATGGACT GGGTCTGGGG CCACGACCTC GCCACCGGCA GACCCCTGCT TGTCCCCGCG GACATCGGCT TCTACCAGTA CGACCACCGC TTCAAACGCT CGCACCATGC CGCGCAGCGA GCCGCCCCGC ACGACCGTCG CCGCTATTTC CACGACTCGT CGAGTGGCTG CGCGCTGGGC GGAAGCCTGG AAGAGGCGGC ACTGCACTCA CTGTTCGAAC TGGCCGAACG CGATGCCTTC CTCATTGCCT GGCACTGTGC CGTCCCGCTG CCCGCCATCG ACCCGGCCTC CATCACCGAC CCGGCCAGCC GTCGGTTGCT CGACCTGATC GACTCGCGGG GGTTCGACGC CCATCTGCTG GTCGCCACCC AGGACATCGA CCTACCCGTG GTGTGGGCGC TCGCCATGAA CCGTGAGCGG CACTTCCCGG CCACCTTCTC GGCCGCCGGG TCTGGCTGCA ATCCGGCGTC CGTGGTGCGC AGTGCCCTGT GGGAGCTCGG CCAGATCGTC ACCGACCCGG TCACCTGGAC CAGAGCCGAC ATCGAGCCCA TGCTCGCAGA CCCCTGGCTG GTCGAGGAAC TCGACGACCA CCTGCGGCTC TACACCCTTC CTCAGACGCT CGGACGGGTC ACCCCGGTGC TTGGTGGTCT GCGGGTCCCT CTCGACGAAG CGTTTCCCGG ATGGCCCGAC CGGCTGCGCG AGGAGGCGAA GGGCAGCGTG CTCAGGGCGC TGCGAGCCAT GCAAGAACGT TTCGCCCGCG CCGGTCTGGA CCGGATCGTG CTGGTCGACC AGTCCACCCG GGAACACCGG GACCTTCAGG TCGCCGTCGC CAAGGCGGTG GTGCCGGGAA TCATCCCCAT GTGCTTCGGC CACGCGCAGC AGCGGCTGCT GGGCCTGCCC CGGCTGACCG CAGCGCTCGC GGGCACGCCA ACGGCCGACC GGCCCTGCCC TTATGACCCT CATCCGTTCC CGTGA
|
Protein sequence | MPEPSKVAFA LPIPALVRVR GAIDEAWHEA LADLPDAVGS TVSLGIGWDL AWEREQWRSA VEDRRSHLSV RLYADETLIG PLWAPATDAG CSGCAEVRSR VVRAHPMVEA LQVPTEVPLP RSPLLPELLA AAVQHLTAQP LGPGELYAVG TGSTRRHRVP RSSGCPLCGA RTTGTATAPP PGRLVLHDHP ADPHDPTRGL TGRRLLAPGA LRRRTVDPRF GPVREVVRES RAPYAMSMAA LPNAPAMGYA RAVDFETAEP VAVLEAYERL GGFPYEASVI EDVAYQEVAE HAVDPASLGG YTAQQLAHPS TRVTPSFPST PMDWVWGHDL ATGRPLLVPA DIGFYQYDHR FKRSHHAAQR AAPHDRRRYF HDSSSGCALG GSLEEAALHS LFELAERDAF LIAWHCAVPL PAIDPASITD PASRRLLDLI DSRGFDAHLL VATQDIDLPV VWALAMNRER HFPATFSAAG SGCNPASVVR SALWELGQIV TDPVTWTRAD IEPMLADPWL VEELDDHLRL YTLPQTLGRV TPVLGGLRVP LDEAFPGWPD RLREEAKGSV LRALRAMQER FARAGLDRIV LVDQSTREHR DLQVAVAKAV VPGIIPMCFG HAQQRLLGLP RLTAALAGTP TADRPCPYDP HPFP
|
| |