Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3414 |
Symbol | |
ID | 5671785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4044349 |
End bp | 4045455 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242302 |
Product | integrase catalytic region |
Protein accession | YP_001507722 |
Protein GI | 158315214 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGCCG GACTGGGCCT GTGTCGGGCA GGCTGGATGG TCATGGTGTG GTCGCTGCTC TACGCCCTGA CACGCAACGC TCTCGGACTG ATGTTGCTCC ACGTGCGCGG CGACACCGCG AAAGACGTCG AGCTCCTCGT CCTACGACAT CAGGTGGCGG TGTTACGACG GCAGGTGAAC CGTCCGACGC TGGAACCGGC GGATCGGGTG ATCCTCGCGG CGCTGTCCCG GCTGCTGCCC CGGGCTCGCT GGGGTTCGTT CTTCGTCACC CCGGCCACCG TGTTGCGCTG GCACCGGGAA TTCCTCGCAC GAAAATGGAC CTATCCCCGC AAGACACCCG GGCGGCCGCC GGTCCGCAGG GAGATCCGCG AGCTGGTCCT GCGCCTCGCG CAGGAAAATC CGACCTGGGG CCACCGCCGG ATCCAAGGCG AACTCGTCGG GCTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC CTGCACCGCG CCGGCATCGA CCCCGCGCCC CGGCGGGCCG ACACCTCTTG GCGTACGTTC CTGCGCGCCC AGGCCTCTGG CCTGCTGGCC TGCGACTTCT TCACGGTGGA CACCGTGTTC CTCCAGCGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CCCGCCGTGT TCACGTCCTC GGGGCCACGA AGCACCCGAC CTCGGCGTGG GTCACCCAGC GGGCACGGAA CCTGCTGATG GATCTCGACG AGCGCAGCCA CCGCTTCCGA TTCCTGATCC GTGACCGCGA CACGAAGTTC ACGGCTTCCT TCGACGCTGT CTTCGCTGGT GCCGGCATCG ACGTGGTACG CACACCACCG CAAGCCCCGA CGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGC ACCGACAGAT TATTGATCGT CTCCGAACGG CACCTGACGT CAGTCCTCGG CAGCTACGCC GAGCATTTCA ACACCCACCG ACCCCACCGC TCCCTCGGCC AGCACCCACC CGACCCGCCG CCCATGGTCA CCCCGACCTC GGATTCCACC GTCCGTCGCA CCCGCATCCT CGGCGGGCTG ATCAACGAGT ACCGCAACGC CGCCTGA
|
Protein sequence | MPAGLGLCRA GWMVMVWSLL YALTRNALGL MLLHVRGDTA KDVELLVLRH QVAVLRRQVN RPTLEPADRV ILAALSRLLP RARWGSFFVT PATVLRWHRE FLARKWTYPR KTPGRPPVRR EIRELVLRLA QENPTWGHRR IQGELVGLGY PVGVATVWRI LHRAGIDPAP RRADTSWRTF LRAQASGLLA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GATKHPTSAW VTQRARNLLM DLDERSHRFR FLIRDRDTKF TASFDAVFAG AGIDVVRTPP QAPTANAIAE RWVGTARREC TDRLLIVSER HLTSVLGSYA EHFNTHRPHR SLGQHPPDPP PMVTPTSDST VRRTRILGGL INEYRNAA
|
| |