Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2098 |
Symbol | |
ID | 5670498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2522073 |
End bp | 2523653 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241019 |
Product | hypothetical protein |
Protein accession | YP_001506440 |
Protein GI | 158313932 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.655673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGTCG CGGCAGCGGC CGCCTGGGAC GAGGTCTTCG ACGCGGCGCA TCGCCCCCGG GAGGTGTACA CCGCCCTGCA CGACGCGCTG CAGCCGCTGA GCAGCTCCGA CCTCGCGGCC CGCAAGATCG CGCTCGACCG TGCCTTCCGG GACGCCGGCA TCACCTTCAA CCTGTTCGGC GAGGAGCGGC CGTTCCCGCT GGACCTGGTG CCCAGGCTGC TCTCCTGCGA CGAGTGGGAC GTCATCGAGC GGGGCGTGAC CCAGCGGGTG CGCGCGCTCG AGGCGTTTCT CGACGACGTC TACGGGCGTG CCGACGTCCT CGCGGACGGC ATCGTGCCCC GCCGGCTGGT GCTGTCCAGC TCGCACTTCC ACCGTGCGGC GCACGGCATC GACCCGCCGA ACGGCGTCCG CGCGCATGTC AGCGGCATCG ACCTGGTGCG GGACGAGCGC GGCGACTTCC GGGTGCTCGA GGACAACGTC CGCGTCCCGT CCGGGGTCAG CTACGTCATC GAGAACCGGC GCGCGATGAC CCGGGTGTTC CCGGAGCTGT TCTCCACCCA CCGGGTGCGC CCGGTCGCCG ACTACGCCAC CCACCTGCTG CACGCGCTGC GCGCGGCGGC GCCACCGGAG GTCGCCGACC CGACCGTCGT GGTGCTCACC CCGGGCGTGT ACAACTCCGC CTACTTCGAG CATGCGCTGC TGGCCCGCCA GATGGGCGTG GAGCTGGTCG AGGGCCGGGA TCTCTCCGTC CGGAACAACC GGGTCACGAT GCGCACCACC GAGGGTGACC AGCCGGTGCA CGTTGTCTAC CGCCGGGTCG ACGACGACTG GCTCGACCCG CTGCACTTCC GTCCCGAGTC GATGGTCGGC TGCGCGGGGC TGCTCAACGC GGCCCGGGCT GGGAACGTGA CGATCGCGAA CGCGGTCGGC AACGGGGTCG CCGACGACAA GCTGATGTAC ACCTACGTCC CGGACCTCAT CCGTTACTAC CTCGGTGAGG AGCCGGCGCT CGGCAACGTC GACACCTTCC GGCTGGAGGA CCCGGACCAG CGCGCCCATG TGCTGGACAA CCTCGAGTCC CTGGTGGTCA AGCCGGTGGA CGGCTCCGGC GGCAAGGGGA TCGTGATCGG CCCGCAGGCG ACCGAGGCCG AGCTGGTCGC GCTGCGCGCG CGGGTGCTCG CCGACCCGCG CGGGTGGATC GCACAGCGGG TGGTGAAGCT GTCGACCTCC CCGACCCTGG CCGATGACCG CCTCGGGCCG CGCCACGTCG ACCTGCGGCC GTTCGCGGTG AACGACGGGA ACCGGATCTG GGTGCTGCCC GGCGGGCTGA CCCGGGTCGC GCTGCCCCGC GGCAGCCTGG TCGTGAACTC CAGCCAGGGC GGCGGTTCGA AGGACACCTG GGTGCTCGCC CCCGAGCGGG TCGGTCCGGA GGGGGCCGCG CTTCTGCGTC GGCGGCCGGG CCTGACGCCG TCGGTCGCCG CCGGGCCCGA CCTCGGCCCG CACTCGTCCG ACGAGCAGCA GCAACAGCAG TCCGAGCAGC AGAACCAGCA GCAGAACCAG CAGGGGGACG GGCTGTGTTG A
|
Protein sequence | MEVAAAAAWD EVFDAAHRPR EVYTALHDAL QPLSSSDLAA RKIALDRAFR DAGITFNLFG EERPFPLDLV PRLLSCDEWD VIERGVTQRV RALEAFLDDV YGRADVLADG IVPRRLVLSS SHFHRAAHGI DPPNGVRAHV SGIDLVRDER GDFRVLEDNV RVPSGVSYVI ENRRAMTRVF PELFSTHRVR PVADYATHLL HALRAAAPPE VADPTVVVLT PGVYNSAYFE HALLARQMGV ELVEGRDLSV RNNRVTMRTT EGDQPVHVVY RRVDDDWLDP LHFRPESMVG CAGLLNAARA GNVTIANAVG NGVADDKLMY TYVPDLIRYY LGEEPALGNV DTFRLEDPDQ RAHVLDNLES LVVKPVDGSG GKGIVIGPQA TEAELVALRA RVLADPRGWI AQRVVKLSTS PTLADDRLGP RHVDLRPFAV NDGNRIWVLP GGLTRVALPR GSLVVNSSQG GGSKDTWVLA PERVGPEGAA LLRRRPGLTP SVAAGPDLGP HSSDEQQQQQ SEQQNQQQNQ QGDGLC
|
| |