Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3445 |
Symbol | |
ID | 5671816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4076633 |
End bp | 4077709 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242333 |
Product | integrase family protein |
Protein accession | YP_001507753 |
Protein GI | 158315245 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.210447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCCTGG CAGTGGTGCG GGATCTCCGC GAGGCGCGTG CGCCCGCGAG CCCGGACGAG GTCACGGAGT TCGAGACGGA TGTGCTGGCC GGGTTCGTGC TGGCGCGGGC CGCGGCGGGT CTGGCCGACA AGACGATCCG CTCCGACGTC GGACACCTGG AGCAGATCCG GTCCTGGTTC GGCTGTCCAT TATGGGAGAT GGCGCCTTCC GATGGGGATG TCTATTTTGG GCAGGTCCTG CGCCGCGCGG CGCAGGCAAC CCGGTCGTCC CGAGCGCAGG CACTGAAAAC CTACTTCCTG TTCCTGGAAC TGCGCCACAA AGTCGAGATC CACAACATGA CCGGCCACAT CGTGGCGTGC CCGATCGACG AGATGAACCG GCCGCGCGGG CAGACCGGTG CCTCGTTGCG TATACCGCCG AGCCAGGCAC AGATTGATGC GTTGTTTACC GGGTGGCGTG CCGAGTTGGC TACCTGCCGC AAGTTCGCCA CCGCCGCGCG TAACTACGCG GCGGCGCGGC TGATGTCCGA GGTGGGCCTG CGCGTCAACG AGGTGTGCAA CCTCGACCTG GACGACATCA AGTGGGACCT CGGCCGGTTC GGCAAGCTGC ACGTCCGGTG CGGTAAGGGC TCGCGCGGAA GCGGGCCGCG GGAGCGGATG GTTCCTCTCA TCCGGGGCGC GGACCGTACA CTGCGCTGGT TCGTCGAGGA CGTGTGGGGC CACTTCGACA ATGATCATAC CCGGCCCGGC AACCCGCTGT TCCCCTCGGA ACGCCGCAAC GGCGACGGAA CATGCGTACG CGTCGGCGAC GACGCCCTGC GCAACGGTCT CGTCGAGGCC GCATCCCGTC ATCTGCCGGA ATGGCAAAAC GCACTCACGC CGCACGTGCT GTGGCACTTC GCCGCCTCCC AGTTCTACCT GACGGGAATG GACCTGATCG CTATCCAGGA GGTCCTTGGT CATCGCTGGG TCGCAACCAC GATGCATTAC GTCCACGTGC ATCGCGGTCA CATCGAAGAC GCGTGGGTGG CCGGTCAGCA ACGGGCCGCC GACCGGCTGA AGGGACGGCT GCCATGA
|
Protein sequence | MGLAVVRDLR EARAPASPDE VTEFETDVLA GFVLARAAAG LADKTIRSDV GHLEQIRSWF GCPLWEMAPS DGDVYFGQVL RRAAQATRSS RAQALKTYFL FLELRHKVEI HNMTGHIVAC PIDEMNRPRG QTGASLRIPP SQAQIDALFT GWRAELATCR KFATAARNYA AARLMSEVGL RVNEVCNLDL DDIKWDLGRF GKLHVRCGKG SRGSGPRERM VPLIRGADRT LRWFVEDVWG HFDNDHTRPG NPLFPSERRN GDGTCVRVGD DALRNGLVEA ASRHLPEWQN ALTPHVLWHF AASQFYLTGM DLIAIQEVLG HRWVATTMHY VHVHRGHIED AWVAGQQRAA DRLKGRLP
|
| |