Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4201 |
Symbol | |
ID | 5672556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5002147 |
End bp | 5004084 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243074 |
Product | alpha/beta hydrolase domain-containing protein |
Protein accession | YP_001508491 |
Protein GI | 158315983 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0657] Esterase/lipase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.538787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.933486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGGG TTTTCGCCGC GCCGGCTTGC CCGGCCGCCG CGGCCGGGTC ACACTGCGAC CCAGTGGCCA CGCCCACACC CCGCGCCGCC CGCTCCCACC CGGCGACGCG CGTGGTGATT GGGGAGGGGG AGCATGTTCG GTCGCCGCAG GACCACCGTG GACGAGCATC CATCCGACCA CGCGGCGCGC GGGCCGGCAC CTGCCGCCTT GGACGGCCCC GGCCCGTACG ACGTCTCCGA GCTCGCCTGG CCCGGCGAGG CAGGCGAGGC GGCGCACGAG GCGCCGACCA ACCCGCCCGG CGACGAGGCC GGTTCCGGCG ACGCCGTCCC GGGTGTCGGA GCGGTACCCG GTGGGGCGGG GCGGCGCGCG CCCGAGATTC TCGACCTCGG CGCGCTGCGC GTCGCGGTGC CGCCGGGAGT GACCGTGCGG ATCGCCCGCG GCACGCAGAC CGGGCCGGGC ACCGACCTCG TCCTGACGGC CGCCGGGCTC ACCGTGCGGG TGACCGTGTT CGCCGCGCCG ACCAGCGGGC AGCTGTGGGC CAGAGTACGC GCCGAGCTGG CCGCCGGCCA CCCGGAGTCG GAGGTCCGCG ACGGCCCCCA CGGCCCGGAG CTGCTGCTCC CGGGCTGCCG GGTCGTCGGA GTCGACGGCC CGCGCTGGTT CCTGCGGGCG GTCGTGACCC CGGCCGACGC CGACGGGGCC GACGAGATCC TGCGCGGCCT CGTCGTCGTA CGCGGCCCCG GTGCCATGCC GGCGGGCACG GCCCTGCCGC TGCTGCCGGT CGGGCCGGCC GGCAGCCGCC CGAACGCCGG AGCCCTCGAC GACCTGTGGA CCGACGAGCC GCTGACGGCC GGCACCGGAA CCCTCGAGGG TCGCCGCGCC GTGCACGAGT ACGCGGCGGC CTTCACCGGC GACCTCAGCC GCAACCTGTC GACCTGGGGC TGAAGCCCCG CCGGGCTGTG GGGCATGATG CAGCCGGGGG TGGGTCGTGC CATGCCCGTC TCCCCCGTCT CCCGACCCGA CTCGACCTTG ATGGCGGCGG TGTGCGCACC GATCAGGACC TGGCCTACGC GACGCTGTCC GACGCGCAGC GTCTCGATCT GCTCCTGCCC ACGGGCGGCG GCGATCCGCC TCCGGTCGTC GTCGCGATCC ACGGCGGCGG GTTCGCCGTC GGCGACAAGC AGGACATGGC CCGCACCGCG CACGCGCTCG CCGGCGCGGG CTACGCGGTG GCCAGCGTCA ACTACCGGCT CTCCGGTGAG GCGGCCTTCC CGGCGGCGGT CGCCGACGTC CGGGCCGCGG TGCGCTGGCT GCGGGCGAAC GCGCGCCGCC TCGGGCTGGA TCCGGCCCGG ATCGGGGTGA TCGGCGAGTC GGCCGGCGGC TATCTCGCCG CCATGCTCGG CGCCGCCGGC GACGACCCGC TGCCGGGGGA CGTCGACCTG GGCCCTGCCG TCGGCCTGGA CCCTGGCGTG GACCTCGGCC CGGCCGGAGC GCGGCCGTCC AGCGCGGTGC GGGCGGTGGT CGACCTGTAC GGCCCGGTGG ACTTCTCGAC CATGGACGCC CAGCTGCGCG CGAATCCGCG CTGCCCGGCC CGGGCCGCCT CGCACGACCG CGCGGACTCC CCCGAGTCGC GTTTCCTCGG CGCGCAGATC ACCGCCGCGT CGGAGCTGGT GCGCCTGGCC AGCCCGCTGT CCCACCTGCG CCGTGACCGC CCGCCGCCGC CGTTCCTGAT CGAGCACGGC GACACCGACT GCACCGTCCC CTACCAGCAG TCGCAGCAGC TCGCGGACGG CCTGTGCGCC GCCGGCGGGT CGGTCGAGCT CACCCTGCTG CGGGGGGTGG GCCACGGCGG GGCCTTCCCG CTCGCCGAGC GCCTGCCAGG CATCATCCAG TTCCTGGACC GCGCTCTGGA CCGCGCTCTG GACCACGCCC CGCGCTGA
|
Protein sequence | MTRVFAAPAC PAAAAGSHCD PVATPTPRAA RSHPATRVVI GEGEHVRSPQ DHRGRASIRP RGARAGTCRL GRPRPVRRLR ARLARRGRRG GARGADQPAR RRGRFRRRRP GCRSGTRWGG AARARDSRPR RAARRGAAGS DRADRPRHAD RAGHRPRPDG RRAHRAGDRV RRADQRAAVG QSTRRAGRRP PGVGGPRRPP RPGAAAPGLP GRRSRRPALV PAGGRDPGRR RRGRRDPARP RRRTRPRCHA GGHGPAAAAG RAGRQPPERR SPRRPVDRRA ADGRHRNPRG SPRRARVRGG LHRRPQPQPV DLGLKPRRAV GHDAAGGGSC HARLPRLPTR LDLDGGGVRT DQDLAYATLS DAQRLDLLLP TGGGDPPPVV VAIHGGGFAV GDKQDMARTA HALAGAGYAV ASVNYRLSGE AAFPAAVADV RAAVRWLRAN ARRLGLDPAR IGVIGESAGG YLAAMLGAAG DDPLPGDVDL GPAVGLDPGV DLGPAGARPS SAVRAVVDLY GPVDFSTMDA QLRANPRCPA RAASHDRADS PESRFLGAQI TAASELVRLA SPLSHLRRDR PPPPFLIEHG DTDCTVPYQQ SQQLADGLCA AGGSVELTLL RGVGHGGAFP LAERLPGIIQ FLDRALDRAL DHAPR
|
| |