Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3838 |
Symbol | |
ID | 5672201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4560253 |
End bp | 4561365 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242716 |
Product | integrase catalytic region |
Protein accession | YP_001508136 |
Protein GI | 158315628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.167789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGCCG AACGAGGCCC GCGTCGGGCA GGCTGGACGG CCATGGCGTG GTCGCGGCTC TACGCCCTGA CGCGCAACGC TCTCGGACTG ATGTTGCTCC GCGTGCGTGG GGACACCTCG AAAGAGGTGG AGCTCCTCGT CCTGCGACAT CAGGTGGCGG TGCTGCGACG GCAGGTGAAC CGGCCGGCGC TGGAACCGAA GGATCGGGTG ATCCTCACGG CGCTGTCCCG GCTGCTACCC CGCGCCCGCT GGGACGTCTT CGTCGTCACC CCGGCCACCG TCTTGCGCTG GCATCGTGAC CTCCTCGCAC GACAATGGAC CTACCGGAGC AAGAAGCCCG ACCGGCCACC GATCCGACAT GAGATCCGTG AGCTGGTCCT GCGCCTCGCG CGGGAGAACC CGACCTGGGG CCACCGCCGG ATCCAAGGAG GACTCGCCGG GCTGGGCTAC CCGGTCGGAG TCGCCACCGT CTGGCGGATC CTGCACCACG CCGGTGTCGA CCCCGCGCCC CGCCGGGCCG ACGCCTCCTG GCACACATTC CTGCGCGCAC AGGCCTCCGG CATACTGGCC TGCGACTTTT TCGCGGTGGA CACCGTGTTC CTGCAACGGA TCTACGTGTT CTTCGTCGTG GAGATCGCCA CCCGCCATGT CCATGTCCAT GTCCTCGGAG TCACGAAACA CCCGACCGCG GCCTGGGTCA CCCAGCGTGC ACGGACCCTG CTGATGGATC TCGAGGACCG CGGCCGCCGG TTCCGGTTCC TCATCCGTGA CCGCGACACG AAGTTCACAG CTTCCTTCGA CACCGTCTTC ACGGCAGCCG ACATCGACGT GGTACGCACG CCCCCGCAGT CGCCCCAGGC AAACGCGATC GCGGAACGCT GGGTGGGCAG CGCCCGCCGC GAATGCACCG ACAGACTGCT GATCGTCTCC GAACGGCACC TGACGTCAGT CCTCACCAGC TACGCCGAGC ATTTCAACAC CCACCGGCCT CACCACTCCC TCGGCCAGCA CCCACCCGAC CCGCCACCCG TGGTCGCCCC GCCCCTGGGT TCCACCGTCC GTCGCACACG CATCCTCGGC GGGCTGATCA ACGAGTACCG TAACGCCGCC TGA
|
Protein sequence | MPAERGPRRA GWTAMAWSRL YALTRNALGL MLLRVRGDTS KEVELLVLRH QVAVLRRQVN RPALEPKDRV ILTALSRLLP RARWDVFVVT PATVLRWHRD LLARQWTYRS KKPDRPPIRH EIRELVLRLA RENPTWGHRR IQGGLAGLGY PVGVATVWRI LHHAGVDPAP RRADASWHTF LRAQASGILA CDFFAVDTVF LQRIYVFFVV EIATRHVHVH VLGVTKHPTA AWVTQRARTL LMDLEDRGRR FRFLIRDRDT KFTASFDTVF TAADIDVVRT PPQSPQANAI AERWVGSARR ECTDRLLIVS ERHLTSVLTS YAEHFNTHRP HHSLGQHPPD PPPVVAPPLG STVRRTRILG GLINEYRNAA
|
| |