Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2617 |
Symbol | |
ID | 5671011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3098271 |
End bp | 3100142 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241533 |
Product | hypothetical protein |
Protein accession | YP_001506953 |
Protein GI | 158314445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.129951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGCG GTGCCGGCCC GGGTGGTCGC CGCGCGACCC AGCCGAACGC AGGCTGGCGT CCAGCGGATC GGCTGCCGCC GGACACGGAC GGCGTGCGGG TGCGGTTCGT CGAGGAGTCC GGATGGCGGT CGAAGGTGTT CGAGTTCGCG GGTCTGCCGG TCGAGCCGGA TGTTCAGCGG TGGCTGGCGC GGGTGTTCGC CCGCCGGGCC GGTCCGCGCT CCGCGACGAA ACGGATCGCT ACGGCCGCCG GGCACTTCCA TGTCCTTCAG ATCTTCGCCG CGTTGCTCGC TGAGACGTCT ACGCCGCCGC GCGGCCCCGC CGACCTTCGG CCGGCGCACA TCAGCGCGTT TCTGCTCCGC TACGGCGGCC AGCGGTGCCA GCGGGAGTAC CTCAAACGGC TGCGGGTGCT GCTCCGCGAC GACCCCGAGC TGCCCGAACT GACCCGGACG GCGTTGCTGT CCGAGCGGGT CGCGCCGCCG GCCGAGAGCA GCGCGGTCGT CGGCTACAGC GACGCAGAGT GGCAGCAGAT CATGACCGCG GTGCGGCGGG ACTTGCGGCT CGCCCGGGAC CGCATCCGGG ACGGCCGGCG GCTGCTCGAC CGTTTCCGCG CCGGCGGTGT CCCGCCCGTC AGTCGGGGCG CCGAATTCGG CCTCCTGCTG GACGTCTTCG ACCGCACCGG CGATCTTCCC CGGGTCAGCT CGGGCCAGCA TTCCCGTGCT GTCCTGCGCG CCGGTGGCAT GACCGCGGTC GGCGGGCGGC TGTGCCTGTC CAGCGACGAG GCGGTCGCGT TCTGCCTGCT GCTGGTCGCG TTGACCGGGG AGAACTTCGG CACCGTCGCG GCCTGGCCGG CGGTGTGCCA CCGGCCGGAC GGCGCCGACA GCGACACCGG TGTCGCGCTC GTGGAGGCGG TCAAACCCCG GCGGGGACCC GACCGCGAGC ATATGATCAC TGTGTTGGAG GACGTGCCCG CCGGCCTGGC CGAAGTACTG GACACCCCAG GTGATGATCA TCGACTGTTC CGCTCTCCGC TACGGGTCTA CCGGCTGCTG GTCGAGCTCG GCGAGGTCTC GTGCCGCCAC GGCGGCCACC ACGGCGCGGT GAGCGCGTTC GTCCCCCGGC CGGGCAAGTT CGGCTCCCGC TGGGTCGAAG GGGCCAACGC CCAGGACCTG GTCGGCTGGG CTCGGCGGCG CGGCTTCCCC GCCGCCGCCC ACGCCGGACC GGACACGAAA CCGGCGGTGC ATGTGGGGCG CCTGCGCCAG AGCGTGATCG AACGTCGCTG TCAGCCCGTC GCGCACAGCC GGCACACGAT GAACGACCAC TACCTGCGGC GCAGCCACAC GGTCCGGGAC GACAGCCGTG TCGTGGTCGG CGACGCGCTT CGGGAGCAGG TCGACAAGGC GCGGGCGACC CAGAGCATCC CGGTGTTTAC CGCGGACTTT CTCGCCGACG CCCGCCGCGA TCTCGTCACG GCCGCGGCGA AAGCCGGGGT CGACCCGGAC ACTCTGCGGG GTCTGATCGC GGGAGCGCAA GACACCGCCC TCGCCTCCTG CACCGAACAT CGGAATGGTC CGCACGTCGC GCCGGGCCAG CCGTGCCCGG CGTCGTTCCT GGACTGTCTG GACTGCCGGA ACGCCCGCGC CCTGCCCCAC CAGCTCGGTG TCCAGATCGT CGCCGTCGAC CGGATGTGCG CGCTGCGCCC GCACCTCGAC CCCGCCGCCT GGACGGCGCG CCTGGGCCGG CGTCTCGACC AGTTGGAGGA GATCCTGAAC CACTACACCC GCGCCGAACG TGACCGCGCA CGGGAGACCG TGACCGACCG GCAAAGGCAG CTTGTGGGCG AGCTCCTCGA CGGCCGCTGG GACCTGCGAT GA
|
Protein sequence | MSGGAGPGGR RATQPNAGWR PADRLPPDTD GVRVRFVEES GWRSKVFEFA GLPVEPDVQR WLARVFARRA GPRSATKRIA TAAGHFHVLQ IFAALLAETS TPPRGPADLR PAHISAFLLR YGGQRCQREY LKRLRVLLRD DPELPELTRT ALLSERVAPP AESSAVVGYS DAEWQQIMTA VRRDLRLARD RIRDGRRLLD RFRAGGVPPV SRGAEFGLLL DVFDRTGDLP RVSSGQHSRA VLRAGGMTAV GGRLCLSSDE AVAFCLLLVA LTGENFGTVA AWPAVCHRPD GADSDTGVAL VEAVKPRRGP DREHMITVLE DVPAGLAEVL DTPGDDHRLF RSPLRVYRLL VELGEVSCRH GGHHGAVSAF VPRPGKFGSR WVEGANAQDL VGWARRRGFP AAAHAGPDTK PAVHVGRLRQ SVIERRCQPV AHSRHTMNDH YLRRSHTVRD DSRVVVGDAL REQVDKARAT QSIPVFTADF LADARRDLVT AAAKAGVDPD TLRGLIAGAQ DTALASCTEH RNGPHVAPGQ PCPASFLDCL DCRNARALPH QLGVQIVAVD RMCALRPHLD PAAWTARLGR RLDQLEEILN HYTRAERDRA RETVTDRQRQ LVGELLDGRW DLR
|
| |