Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4142 |
Symbol | |
ID | 5672499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4924092 |
End bp | 4925363 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243017 |
Product | putative transposase |
Protein accession | YP_001508434 |
Protein GI | 158315926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.304536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTGCACGT TGCCGCGGCT GGGCTTCGTG CCGGACGACG TGACGTCGGC GCCGCGGGCG GCGATGGTAC GGCTGGCTGA GCGGTTAGGC GTCGACCCGG ACGTCATCGG TTCGTACGGA CGGCGGGCCA AGACGCGCAC GGATCATCTG CGGCTGGTGG CGCAGTATCT GGGGTGGCGG GTGCCGACGA CGCTGGACCT CAAGGAGCTG GACGAGTTTC TGTTGGCGCG GGCGATGGAA CACGACGCGC CGACGTTGCT GTTCCGGCTG GCGTGCGAGT ACCTGATCTC GGCGAAGGTG ATCCGGCCGG GTCCGGTGAC GGTGGTGAAG CGGGTCGCGC ACGCCCGCGA GGTCGCGCAG CAGGAGACGT TCGACCGGCT GGCGCACGAG TTCACCGGCG AGCGTCGCGT TGGGCTGGAC GGCCTGCTGG TGACCGACCC TGAGATCGGG CGCGCACACC CTGGATTGTC GGTGCTGCCG GCGGAACGGC GCAGGTTCCT GGCCACGGTG GGCCGCCGGC TGACGGCGCA GGCGTTACAG CGGCGCGAGC CGCAGCGTCG GTACCCAATC CTGCTGACCT TGTTGGCGCA GTCGGCGATC GACGTGCTGG ACGAGGTCGT GCAGTTGTTC GACCAGGCCG TCTCGGCGCG GGAAGCCAAG GCCGCGCACA GGATGCGCGA CGAGTTGGCC GAGCGCGGCA AGGCTGGCGA GGAGCGCCAG GCCCTGCTGG ACACGGTCTT GGCGATCGTC GCCGACCCGG CGATCCCGGA CGAGGACGTC GGCGGGCTGA TCCGCGGGGA GAAGGTGGGG TGGGAGCGGC TGCGTGCCGC GCAGGCCGCC GCGTTGCCAC CGCTGCCACG CGACCACGGG CACCTCGCGT CGCTGGACGG TTCCTACGGG TACCTGCGGC AGTTCACCCC ACAGGTACTG GACGCGGTGA CCTTCGCCGG CGGCACGGCC ACGGCCGACC TGCTCAAAGC GGTGGACATC CTGCGTGAGT TGAACGCCAC CGGGGCCCGC AAGGTCCCGG ATGACGCGCC GAGCGGCTTC GTCCCGGCCC GCTGGCGCGG CTACCTCGAC ACCGCGGCGA AGGCGGACAG TGTCACCGCC TACCGCCACT ACTGGGAGCT GTGCACCTTG CTGGCGTTAC GCGACGGGCT ACGTACCGGC GACGTGTTCG TGTCTGGCTC GCGCCGCTAC TCCGACCCGG CCGCCCACCT GCTCACCCCT GAGAAGTGGG CCGACCAGCG GGCCGATGGA CGCCGACGGT GA
|
Protein sequence | MCTLPRLGFV PDDVTSAPRA AMVRLAERLG VDPDVIGSYG RRAKTRTDHL RLVAQYLGWR VPTTLDLKEL DEFLLARAME HDAPTLLFRL ACEYLISAKV IRPGPVTVVK RVAHAREVAQ QETFDRLAHE FTGERRVGLD GLLVTDPEIG RAHPGLSVLP AERRRFLATV GRRLTAQALQ RREPQRRYPI LLTLLAQSAI DVLDEVVQLF DQAVSAREAK AAHRMRDELA ERGKAGEERQ ALLDTVLAIV ADPAIPDEDV GGLIRGEKVG WERLRAAQAA ALPPLPRDHG HLASLDGSYG YLRQFTPQVL DAVTFAGGTA TADLLKAVDI LRELNATGAR KVPDDAPSGF VPARWRGYLD TAAKADSVTA YRHYWELCTL LALRDGLRTG DVFVSGSRRY SDPAAHLLTP EKWADQRADG RRR
|
| |