Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0441 |
Symbol | |
ID | 5675655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 522170 |
End bp | 523276 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239374 |
Product | integrase catalytic region |
Protein accession | YP_001504812 |
Protein GI | 158312304 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGCCG GACAGGGCCC ACATCGGGCA GGCTGGATGG TCATGGTGTG GTCGTTGCTC TACGCCCTGA CACGCAACGC TCTCGGGCTG ATGCTGCTCC GGGTCCGTGG GGACACCGCG AAGGACGTCG AGCTCCTCGT CCTGCGGCAT CAGGTGGCGG TGTTGCGACG GCAGGTGAAC CGCCCGGCCC TGGAACCGGC AGATCGGGTG ATCCTCGCAG CCCTGTCCCG GCTGCTACCC CGGGCCGGCT GGGGTTCGTT CTTCGTCACC CCGGCCACCG TGCTGCGCTG GCACCGTGAG CTCCTCGCGC GAAAATGGAC CTATCCGCGC AAGACCCCTG GGCGGCCGCC GGTCCGCCGG GAGATCCGTG AGCTGGTTCT GCGTCTCGCG CGGGAGAATC CGACCTGGGG CCACCGCAGG ATCCAGGGAG AACTGATCGG GCTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC CTGCACCGCG CTGGTGTCGA CCCCGCGCCG CGGCGGGCTG ACGCCTCTTG GCGTACGTTC CTGTCCGCGC AGGCCTCCGG CCTGCTGGCC TGCGATTTCT TCATGGTGGA CACTGTGTTC CTGCAGCGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CGCGCCGTGT TCATGTTCTC GGGGTCACGA AGCATCCGAC CTCGGCGTGG GTCACCCAGC GTGCGCGGAA CCTGCTGATG GATCTCGACG AGCGTTGCCA CCGGTTCCGG TTCCTGATCC GTGACCGCGA CATGAAGTTC ACGGCTTCCT TCGACGCTGT CTTCATCGGG GCCGGTATCG ACGTGGTACG CACACCCCCG CAAGCTCCGA AGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGCGAATGC ACCGACAGAC TGCTGATCGT CTCCGAACGA CACCTGACGT CAGTCCTCAC CACCTACGCC GAGCACTTCA ACACCCACCG GCCTCACCGC TCCCTCGGCC AGCACCCACC CGACTCGCCA CCCGTGGTCG CCCCGACGTT GGAGTCCACC GTCCGTCGCA CACGCATCCT CGGCGGCATG ATCAACGAAT ATCGCAACGC CGCCTGA
|
Protein sequence | MPAGQGPHRA GWMVMVWSLL YALTRNALGL MLLRVRGDTA KDVELLVLRH QVAVLRRQVN RPALEPADRV ILAALSRLLP RAGWGSFFVT PATVLRWHRE LLARKWTYPR KTPGRPPVRR EIRELVLRLA RENPTWGHRR IQGELIGLGY PVGVATVWRI LHRAGVDPAP RRADASWRTF LSAQASGLLA CDFFMVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTSAW VTQRARNLLM DLDERCHRFR FLIRDRDMKF TASFDAVFIG AGIDVVRTPP QAPKANAIAE RWVGTARREC TDRLLIVSER HLTSVLTTYA EHFNTHRPHR SLGQHPPDSP PVVAPTLEST VRRTRILGGM INEYRNAA
|
| |