Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3039 |
Symbol | |
ID | 5671418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3572412 |
End bp | 3573518 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641241937 |
Product | integrase catalytic region |
Protein accession | YP_001507357 |
Protein GI | 158314849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGCCG GACAGGGCCC GCGTCGGGCA GGCTGGATGG TCATGGTGTG GTCCCTGTTC TACGCCCTGA CACGCAACGC TCTCGGAGTG ATGCTGCTCC GAGTCCGCGG GGACACCGCG AAGGACGTGG AGCTCCTCGT CCTGCGACAT CAGGTGGCGG TGTTACGACG GCAGGTGAAC CGCCCGGCGC TGGAACCGGC GGATCGGGTG ATCCTCGCAG CCCTGTCCCG GCTGTTGCCC CGGGCCCGCT GGGGTTCGTT CGTCGTCACC CCGGCCACCG TATTGCGCTG GCACCGTGAC CTCCTCGCAC GACAATGGAC CTACCCTCGG ACGTCGCCCG GACGGCCATC GGTCCGCCGG GAGATCCGCG AGCTGGTCCT GCGCCTCGCA CGGGAGAACC CGACCTGGGG CCACCGCCGG ATCCAAGGAG AACTCGTCGG GTTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC CTGCACCGCG CCGGTGTCGA TCCAGCGCCC CGTCGGGCCG ACGCCTCCTG GCGCACGTTC CTGCGCGCCC AGGCCTCCGG CATCCTCGCC TGCGATTTCT TCACCGTGGA CACCGTATTC CTACAACGGA TCTACGTGTT CTTCGTCGTC GAGCACGCCA CCCGCCGTGT CCACGTCCTC GGGGTCACGA AGCATCCAAC CGCGGCCTGG GTCACCCAGC AGGCACGGAA CCTGCTGATG GACCTCGAGG AACGTGGCCA CCGGTTCCGG TTCCTCCTCC GTGACCGCGA CACGAAATTT ACGGTTTCCT TCGACGCTGT CTTTGCCGGA GCCGGTATCG ACGTGGTGCG CACACCGCCA CAGTCGCCGC AGGCGAACGC GACCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGC ACCGACAGGC TGTTGATCGT CTCCGAACGG CACCTGACCA CCGCCCTCAC CACATACGCC GAGCATTTCA ACACCCACCG GCCTCACCGC TCCCTCGGCC AGCACCCGCC CGACCCGCCA CCCGTGGTCA CCCCGACCCC GGGTTCCACC GTCCGTCGCA CACGCATCCT CGGCGGGCTG ATCAACGAGT ACCGCAACGC CGCCTGA
|
Protein sequence | MPAGQGPRRA GWMVMVWSLF YALTRNALGV MLLRVRGDTA KDVELLVLRH QVAVLRRQVN RPALEPADRV ILAALSRLLP RARWGSFVVT PATVLRWHRD LLARQWTYPR TSPGRPSVRR EIRELVLRLA RENPTWGHRR IQGELVGLGY PVGVATVWRI LHRAGVDPAP RRADASWRTF LRAQASGILA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTAAW VTQQARNLLM DLEERGHRFR FLLRDRDTKF TVSFDAVFAG AGIDVVRTPP QSPQANATAE RWVGTARREC TDRLLIVSER HLTTALTTYA EHFNTHRPHR SLGQHPPDPP PVVTPTPGST VRRTRILGGL INEYRNAA
|
| |