Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3264 |
Symbol | |
ID | 5671638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3864152 |
End bp | 3865222 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242156 |
Product | integrase catalytic region |
Protein accession | YP_001507576 |
Protein GI | 158315068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATGG TGTGGTCGCT GCTCTACGCC CTGACACGCA ACGCTCTCGG ACTGATGCTG CTCCGCGTGC GCGGCGACAC CGCGAAAGAG GCGGAGCTCC TCGTCCTGCG ACATCAGGTG GCAGTGTTAC GACGGCAGGT GAACCGCCCG ACGCTGGAAC CGGCGGATCG CGTCATCCTC GCAGCCCTGT CCCGGCTGCT ACCCCGGGCC CGCTGGGGTT CGTTCGTCGT CACCCCGGCC ACCGTGCTGC GCTGGCACCG GGAACTCCTC GCACGACAAT GGACCTACCC GCGCAGGACA CCCGGGCGGC CACCGATCCG CCGGGAGATC CGCGAGCTGG TCCTGCGCCT CGCGCAGGAA AACCCGACCT GGGGCCACCG CCGGATCCAA GGCGAACTCG CCGGGCTGGG CTACCCGGTC GGGGTCGCCA CCGTCTGGCG GATCCTGCAC CACGCCGGCG TCGACCCCGC ACCCCGACAG GCCGACACCT CCTGGCGCAC GTTCCTGCCC GCCCAGGCCT CCGGCCTGCT GGCCTGCGAT TTCTTCACGG TGGACACCGC GTTCCTCCAG CGGATCTACG TGTTCTTCGT TGTCGAACAC GCCACCCGCC ACATTCATGT TCTCGGGGCC ACGAAGCACC CGACCACGGC GTGGGTCACC CAGCAGGCAC GGAACCTGCT GATGGACCTC GACGAACGTG GCCACCGGTT CCGGTTCCTC ATCCGTGACC GCGACACGAA GTTCACGGCT TCCTTCGACG CTGTCTTCGC CGGGGCCGGT ATCGACGTGG TGCGCACACC GCCACAGTCG CCACAGGCGA ACGCGATCGC GGAACGCTGG GTCGGCACCG CCCGCCGGGA ATGCACCGAC AGGCTGCTAA TCGTCTCCGA ACAGCACCTG ACATCGGTCC TCGACAGCTA CGCGAAGCAT TTCAACCCCC ACCGGCCCCA CCGCTCCCTC AGCCAGCACC CACCCGACTC GCCACCCGTG GTCGCCCCGA CGCCGGACTC TACCATCCGT CGCACACGCA TCCTCGGCGG CATGATCAAC GAATATCGCA ACGCCGCCTG A
|
Protein sequence | MVMVWSLLYA LTRNALGLML LRVRGDTAKE AELLVLRHQV AVLRRQVNRP TLEPADRVIL AALSRLLPRA RWGSFVVTPA TVLRWHRELL ARQWTYPRRT PGRPPIRREI RELVLRLAQE NPTWGHRRIQ GELAGLGYPV GVATVWRILH HAGVDPAPRQ ADTSWRTFLP AQASGLLACD FFTVDTAFLQ RIYVFFVVEH ATRHIHVLGA TKHPTTAWVT QQARNLLMDL DERGHRFRFL IRDRDTKFTA SFDAVFAGAG IDVVRTPPQS PQANAIAERW VGTARRECTD RLLIVSEQHL TSVLDSYAKH FNPHRPHRSL SQHPPDSPPV VAPTPDSTIR RTRILGGMIN EYRNAA
|
| |