Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0402 |
Symbol | |
ID | 5668826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 480893 |
End bp | 481999 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239335 |
Product | integrase catalytic region |
Protein accession | YP_001504774 |
Protein GI | 158312266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.591562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAGCGG GGCGGGACCC GTGTCGGGCA GGCTGGATGG TCATGGTGTG GTCGCTGCTC TACGCCCTGA CACGCAACAC TCTCGGGCTG ATGCTGCTCC GCGTGCGCGG GGATGCTGCG AAGGACGTCG AACTCCTCGT CCTGCGGCAC CAGGTGGCGG TGTTGCGACG GCGGGTCCAT CGTCCGGCAT TGGAACCGGC GGATCGGGTG ATTCTCGCAG CCCTGTCCCG GCTGCTACCC CGGGCCAGTT GGGACATCTT CTTCGTCACC CCGGCCACCG TGCTGCGCTG GCACCGTGAG CTCCTCGCAC GAAAATGGAC TTACCCGCGC AAGACGCACG GACGGCCGCC GATCCGCCGG GAGATCCGTG AGCTGGTTCT GCGTCTCGCG CGGGAGAACC CGACCTGGGG CCACCGCAGG ATCCAGGGCG AGCTCGTCGG GTTGGGTTAC TCGGTCGGGG TCGCCACCGT CTGGCGGATT CTGCACCGCG CCGGCGTCGA CCCCGCACCT CGGCGGGCCG ACACCTCCTG GCGCACGTTC CTACGCGCCC AGGCCTCCGG CATCCTGGCC TGCGACTTCT TCACCGTGGA CACCGTGTTC CTCCAACGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CCCGCCGTGT TCATGTCCTC GGGGTCACGA AGCACCCGAC CACGGCGTGG GTCACCCAGC AGGCACGGAA CCTGCTAATA GATCTCGAGG AGCGCAGCCA CCGGTTCCGG TTCCTTCTCC GTGACCGTGA CACGAAGTTC ACGTCCTCGT TCGACGCTGT CTTCACTGGG GCCGGTATCG ACGTGGTGCG CACACCACCG CAAGCCCCGC AGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGT ACCGACAGGC TGTTGATCGT CTCCGAACGG CACCTGACGT CAGTCCTCGA CAGCTACGCC GAGCATTTCA ACACCCACCG GCCCCACCGC TCCCTCGGCC AGCACCCACC CGACTCGCCA CCCGTGGTCG CCCCGACGTC GGAGTCCACC GTCCGTCGCA CACGCATCCT CGGCGGGCTG ATCAACGAGT ACCGCAACGC CGCCTGA
|
Protein sequence | MSAGRDPCRA GWMVMVWSLL YALTRNTLGL MLLRVRGDAA KDVELLVLRH QVAVLRRRVH RPALEPADRV ILAALSRLLP RASWDIFFVT PATVLRWHRE LLARKWTYPR KTHGRPPIRR EIRELVLRLA RENPTWGHRR IQGELVGLGY SVGVATVWRI LHRAGVDPAP RRADTSWRTF LRAQASGILA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTTAW VTQQARNLLI DLEERSHRFR FLLRDRDTKF TSSFDAVFTG AGIDVVRTPP QAPQANAIAE RWVGTARREC TDRLLIVSER HLTSVLDSYA EHFNTHRPHR SLGQHPPDSP PVVAPTSEST VRRTRILGGL INEYRNAA
|
| |