Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3349 |
Symbol | |
ID | 3905931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3971566 |
End bp | 3972927 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637880674 |
Product | hypothetical protein |
Protein accession | YP_482435 |
Protein GI | 86742035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00321833 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.277743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCC GTATAGATCA GCAGCGCCGG GCGTCCGCGC CCGGCGAGGT TCCCAACGCC CTGGAAGTCG ACGAGATCGA CGCTGCCGCG GCGCAGCGCC GGGCGTCCGC GCCCGGCGAG GTTCCCAACC TGGTCGCCTA CGTGGTGCCC GCCGAGGGGG AGAGGCAGCG CCGGGCGTCC GCGCCCGGCG AGGTTCCCAA CACCGGTTCG TTGACGAAAT GCAGCCCGAA TACGTCGCAG CGCCGGGCGT CCGCGCCCGG TGAGGTTCCC AACTGCACCG TTGACACATC CCCACTATTC GACACGGTCG AAGCAGCGCC GGGCGTCCGC GCCCGGCGAG GTTCCCAACG TCAACATCGC CGACGGATGG GCGCCGTCCT CGAACGCAGC GCCGGGCGTC CGCGCCCGGC GAGGTTCCCA ACGAGGTGTA CACCACCTCA GACACTCTCG ATGTCGTCGC AGCGCCGGGC GTCCGCGCCC GGCGAGGTTC CCAACTCCGC GAGGAGGTCG GCGAGGGTGG TCAGGCCATC GGCGCAGCGC CGGGCGTCCG CGCCCGGCGA GGTTCCCAAC GCCATCCTGC TGTACGGCGT CGCGACGAAG ATCGAGCAGC GCCGGGCGTC CGCGCCCGGC GAGGTTCCCA ACACCAGCTC CTCGGCGCCG GGGATGTCCG CTGGGAGCAG CGCCGGGCGT CCGCGCCCGG CGAGGTTCCC AACGACAGCT CCGAGATCGG CCCGCTCGGG GACCGGCCGG GCAGCGCCGG GCGTCCGCGC CCGGCGAGGT TCCCAACCCG ATCTGCTGGT CGACATCGCC CCCGAGGGGG AGCAGCGCCG GGCGTCCGCG CCCGGCGAGG TTCCCAACAC CTCGTGGCAG CAGGTCCAGG ACTACGTCGA GCAGCAGCGC CGGGCGTCCG CGCCCGGCGA GGTTCCCAAC CGCTCGACAG GTCGATCTCG TACAGGGCGC CGTCCAGCAG CGCCGGGCGT CCGCGCCCGG CGAGGTTCCC AACGATGGCG CACCGGCGCA GCTCGGCGTT GCGCTCGGCG AGCAGCGCCG GGCGTCCGCG CCCGGCGAGG TTCCCAACTA CTCCGGCACC GAGACCGCCG CGAACATCAC AGGCAGCGCC GGGCGTCCGC GCCCGGCGAG GTTCCCAACC CCGTGGTGGC GGCATCTGCC GCGATCGCGG CGGCGGGCCG TCGGGGGTCC ATTTCCCACC GAGGACGATG CGCAGAAGGC GCTGATGGCC TCGTTAGCGG AGCTTCACGG CGGCGACTGG CTGGAAGAGG CCTGACTCGA ACCCACGACC GACGGATTAT GAGATCATCG GCAGCCTGTT CGCCGGTCTC CATGGACGTC CGCCGGTCCT GA
|
Protein sequence | MTSRIDQQRR ASAPGEVPNA LEVDEIDAAA AQRRASAPGE VPNLVAYVVP AEGERQRRAS APGEVPNTGS LTKCSPNTSQ RRASAPGEVP NCTVDTSPLF DTVEAAPGVR ARRGSQRQHR RRMGAVLERS AGRPRPARFP TRCTPPQTLS MSSQRRASAP GEVPNSARRS ARVVRPSAQR RASAPGEVPN AILLYGVATK IEQRRASAPG EVPNTSSSAP GMSAGSSAGR PRPARFPTTA PRSARSGTGR AAPGVRARRG SQPDLLVDIA PEGEQRRASA PGEVPNTSWQ QVQDYVEQQR RASAPGEVPN RSTGRSRTGR RPAAPGVRAR RGSQRWRTGA ARRCARRAAP GVRARRGSQL LRHRDRREHH RQRRASAPGE VPNPVVAASA AIAAAGRRGS ISHRGRCAEG ADGLVSGASR RRLAGRGLTR THDRRIMRSS AACSPVSMDV RRS
|
| |