Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3035 |
Symbol | |
ID | 5671415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3568885 |
End bp | 3569952 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241934 |
Product | integrase catalytic region |
Protein accession | YP_001507354 |
Protein GI | 158314846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTAC GCCTGCTCTA TCTGATCTTC GTGCGGGTAT GTGGCTGGCT GGTTCTCCTC GGCCGCTCGT CGGCATCGAA GGACATCGAG CTGCTCGTGC TGCGGCACGA GGTCGCGGTG CTGCGCCGCA CCCAGCCCAA GCCCCGGTGG GACTGGGCAG ACCGGGCGGT CCTCGCCACA CTGATCCGAC TCCTACCCAG GGCCCTGCGA GCGCACCGGC TGGTCACCCC CGGCACCGTC CTCGGGTGGC ACCGCCGTCT CATCACACGG AAATGGACCC ACCCGCAGCG GACCGGACGG CCACCGATCA GCCCGGAGAT CGCCACGCTG ATCAAGCGGC TCGCGACCGA GAACACGACG TGGGGCTACC AGCGAATCCA GGGCGAGCTC CTCAAGCTCG GCCACCGGGT CGGTGCGTCC ACGATCCGCC GGGTCCTGAA GTCCCTGGGT CTCCCGCCGG CGCCCAGGCG GCAGACCGAC ACGACCTGGC GGCAGTTCCT ACGCGCCCAA GCCTCGACCA TGCTGGCAGT CGACTTCTTC CATGTGGACT GCGCCGTGAC GCTGCGGCGT CTGTACTGCT TCTTCGTCCT GGAGGTCGGC TCCCGCACCG TGCACATCCT CGGGGTCACC GCCCACCCGG ACGGGTCGTG GACCACCCAG CAGATTCGGA ACTTCCTGAT GGACCTCGGC GACCGGGCAG GCGACTTCCA GTTCCTGGTC CGCGACCGGG CCGGACAATT CACCGCCTCC TTCGACGCGG TCCTCGCCGA CGCCGGCATC ACAGCCGTCA AGATCCCACC CCGAACCCCA CGGGCGAACG CCTACGCTGA GCGGTTCGTC CGCACCGTCC GGACCGAGGT CACCGACCGG ATGCTGATCT TCGGCGAACG GCACCTGCGT ACCATCCTGG CCGAGTACGC GGCCCACTAC AACGGACGGC GACCCCACCG CAGCCGCGAC CTTCAACCAC CCCGACCCGA CCACCCCATC GCAAACCTGA CCAAGGAACG GATCAAACGT CGGCCTGTCC TCGGCGGCTT GATCAACGAA TACGAACGAG CTGCCTAA
|
Protein sequence | MSVRLLYLIF VRVCGWLVLL GRSSASKDIE LLVLRHEVAV LRRTQPKPRW DWADRAVLAT LIRLLPRALR AHRLVTPGTV LGWHRRLITR KWTHPQRTGR PPISPEIATL IKRLATENTT WGYQRIQGEL LKLGHRVGAS TIRRVLKSLG LPPAPRRQTD TTWRQFLRAQ ASTMLAVDFF HVDCAVTLRR LYCFFVLEVG SRTVHILGVT AHPDGSWTTQ QIRNFLMDLG DRAGDFQFLV RDRAGQFTAS FDAVLADAGI TAVKIPPRTP RANAYAERFV RTVRTEVTDR MLIFGERHLR TILAEYAAHY NGRRPHRSRD LQPPRPDHPI ANLTKERIKR RPVLGGLINE YERAA
|
| |