Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2158 |
Symbol | |
ID | 5670558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2587512 |
End bp | 2589275 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241079 |
Product | heparinase II/III family protein |
Protein accession | YP_001506500 |
Protein GI | 158313992 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA ATGCGGGTCG CTACCTGCGC ACCGTCGCGC TGCTTCGACC CGAACAGACG GTTGCCCGCG TCCGGTTGAG GACACAGCGA GCCGCGCCAA ATCAGTTTCC GGTGGTCTCG GAACGAATGC TCGCGCTCGC CGCGCGGCAG TGGACACGCC CGCTTGGCGG GGCCGGGTGG CCAGCCGAGT TTCTTCCGTT TGACGCAGGG CTGGACGACG TCTGGCCGCC GGCCGATGAG CTCGCTGCCG GGCGTTTCAC GCTTCTGGGC CACAGTCGGG ATCTCGGCGA GCCGGCCGAC TGGATCCAAC GCAACTCGCC TACGCTGTGG CGGTACCACC TGCATTACTG GGACTGGGCG TGGTCTCTGG CACTGCATCC TGAGCGCCGA TGGGCGAGCG GGATCTTCCG TGCGCTATAC CGGTCCTGGC GGGAGAGTAC CTCCCCCGGA CGGGGCGACG CCTGGGCACC CTATGTGGTC TCGCTGCGGA TCTGGTCCTG GTGCGGACTT GTCGACCGGC TGTCCGGCGA CACCGCGACA GCCTCCGCGA TCTGGACCGA TCTTGCCCAG CACATGGTCT TTCTGCGACT GCACTTGGAG ACCGACGTCG GCGGCAACCA CTTGGTCAAG AACCTCAAGG CCCTGCTGGG TGCCGCCGTC GCCTTCCACG ATCCCCAGGC CGTGGACCGC TGGACCGGGC GGCTCGTCGA CGAGGTCGAA CGCCAGATCC TGCCTGACGG GGGCCATACC GAACGCGCGC CCGCCTACCA CTGTCAGGTA CTCGCCGACC TCGACGACGT GGCCGGTCTG CTCGGCGCCG CGGGGCTGCA TGTACCCGAC GAGATCATCG ATGCGCGATA CCGGATGCGG GCCTGGCTCG GCAGCGTGTT GTCACCTGAT GGCGCTGTTC CACTTCTGAA CGACGGCTAC CCGGTTCCTA CCACTGCAGT GACCGTGCTT TGCCCCAGTC TCCCTGGTGA GGAGCCGGAG GCGGACAGGA TCAAAGGTCC GGACGACGTT CGGGAGCAGG GGGTCGGGCT CACGCTGCTA CCCTCATCCG GCCTGGCCGT GCTGCGCGCC GATCGGTGGC GGGTACTGGC AGACATCGGG CTTCCCTGCC CGGACGACCT CCCGGCGCAT GCCCATGCGG ACACGCTCTC CTTCCTCCTC TGGCACGATG GCTGGCCGCT ACTCGTCGAG GTCGGAACAT CGACATACGC CCTTGGGGCA GTCCGGGCCG CCGAACGATC CACCTCGGCG CACAACACCG TCGTCATCGA CGGACACGAC TCCACAGAGG TATGGGGCGC CTTCCGCGCC GGACGCCGGG CCCGGGTCGT TCTCGGGCCG GTCCGCTGCG ATGACGCTGG CGGCAGGCTG GACCTGACCG CATCGCACGA CGGCTACCGC TGGCTGCCTG GACGGCCCAT GCATGCCAGG GCCTGGACGC TGGACGGCAA GGGACTTCAG ATCACCGATC GGGTCACCGG TGCCGGCGAG CATCAGATCG AAATCCTCTT CCATCTCGCG GCCGGCTGGC GGGCCCGGCC GGGGCGTCAC GGACTGACGA TCGAGCACCA CCGGGGAGTC GGTCCGTTCC GCCTGGTCGC GGTCGGGCCC GGCACCTGGC ACACCCGTGA GGGCCAAGTG GCGACCGGTT GGCAGCGGAC CACGCCGGCG ACGGTCGTTG CTTACTGTCT GAATGCACGG CTGCCGGTCG AGGTTCACAC CCAGATAGCC GTAGGGACGG CCGACAATGA CTGA
|
Protein sequence | MIDNAGRYLR TVALLRPEQT VARVRLRTQR AAPNQFPVVS ERMLALAARQ WTRPLGGAGW PAEFLPFDAG LDDVWPPADE LAAGRFTLLG HSRDLGEPAD WIQRNSPTLW RYHLHYWDWA WSLALHPERR WASGIFRALY RSWRESTSPG RGDAWAPYVV SLRIWSWCGL VDRLSGDTAT ASAIWTDLAQ HMVFLRLHLE TDVGGNHLVK NLKALLGAAV AFHDPQAVDR WTGRLVDEVE RQILPDGGHT ERAPAYHCQV LADLDDVAGL LGAAGLHVPD EIIDARYRMR AWLGSVLSPD GAVPLLNDGY PVPTTAVTVL CPSLPGEEPE ADRIKGPDDV REQGVGLTLL PSSGLAVLRA DRWRVLADIG LPCPDDLPAH AHADTLSFLL WHDGWPLLVE VGTSTYALGA VRAAERSTSA HNTVVIDGHD STEVWGAFRA GRRARVVLGP VRCDDAGGRL DLTASHDGYR WLPGRPMHAR AWTLDGKGLQ ITDRVTGAGE HQIEILFHLA AGWRARPGRH GLTIEHHRGV GPFRLVAVGP GTWHTREGQV ATGWQRTTPA TVVAYCLNAR LPVEVHTQIA VGTADND
|
| |