Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2802 |
Symbol | |
ID | 5671191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3316962 |
End bp | 3318575 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241711 |
Product | hypothetical protein |
Protein accession | YP_001507131 |
Protein GI | 158314623 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC CCAGCCAAAG GGTTGAGCAG GACGAGTTGC GGACTCGGAT GCGCGCGACC GGCATGAGCC ACCACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCGGCT ACGCCCCCGC GCTGCCTACC GTGTCGCCCA CGGCTGGACA CAGCAGCAGG CCGCCGACCG CATCAACGCC CACGCCGTCC GTGCCGGCCT CGACCCGGAC GGCACCGCCC CGATGACCGC GCCGCGGCTA TCGGAGGTGG AGAACTGGCC TCGCCCTGCC CGGCGACGTC CCACCCCGCA GATCCTCGCC CTGCTTGCCG AGGTGTACGG ATGCGATCTC CACGCCCTCG TCGACGTGGA CGATCGCGAA CACCTCCCTC CGGCAGACGT GTTCCTGATC AACGGCATGC GCCGGCTGCC GGACGGCGTG GCTGCGTCAT CGACGGCCTC TCCGATCACA CTCGCGGGGA AACGATGGGG AACGACCACC GAACGACCGG TTGACGTCGC CTTCGGCGCC GCCAGAGCTG GGCAAGCACC GGCCCCCAAC TCGGTCGGTC TCGCAGACAA AGACAACGTG ATCATTTTTC CGCAACTCGC CCCGGATGGG AGGATCGTCC TCATGCCACT CGATCGCCGG GGTTTCCTGA GCGGCCTGGG CCTCACCGCC GCCAGCAGCG CAGCCCTCAG CCCGCTCGCC ACGATGCCAC CAGGATCGTC CTCCATCGAC CCACGTGTCG TCGATCACTT CGCACGCCTG CGGGCCGTGC TCGCGGAGAA CGACAACCTC TTCGGGCCGC GCCAGGTCAT CACCACAGCG CAGGAACAGG CCGGCCTCAT CGCCGCCCAT CTCCGCCACG GCACGAGTTC GCGTCCACAA CGGCAGACCC TGCTCCACAT CCAAACGCAG TTCGCTGATC TCCTCGGCTG GCTCCATCAA GACAGCGGCG ATAACGCCAC CGCTGGATAC TGGCTCGACC GAGCGCTGGA ATGGTCACAC CGAGCGAGCG ACCCCAACGC CACCGTGTTC ATCCTTGCCC GCAAAAGCCA ACTCGCCGCA GACCGTGGCG ACCCAGCTGA AGCCGTCGAC GTCGCCGACG CAGCCCTGAC CAGCGCGGAG CCAACCGGTC GTCTCGCCGC CATCGCCGCG ACCTACAGCG CGCATGGCCA CGCACTACGC GGCGAGAAAA CCACCTGCCT GACGCTCTAC GACCGCGCCC ACGACATCCT CGACCAGGCC GGACCCGACA CCGACCCCTG GGGCGAGTTC TTCAGCCCTG CCTACATCGA AGTCCAGCGG GCACACAGCC TCGCCGCACT CGGTGACTAC CCTGCTGCAG CCACCGGGTT CCGGACCGCG ATCGACGGTC TCCCCTCGGC TTTCCACCGC GACCGCGGTG TCTACCTCGC CCGCGAAGCC CTCGCACACG CAGGAGCACG CGAACCAGAG CAAGCCGCGA CACTCGGTCT CAACGCACTC ACGGTCGGCG CCAGCACGCA CTCCGGATGC ATCATGACCA GTCTGCGGTC CCTGCGTGAC GCCGTCGCCG GATGGCAAAC TGTCTCCCAG GTACGCGAGT TCCGCCAGGC GATGGACCAG GTCCCCACGG CCATCACCGT CTGA
|
Protein sequence | MNKPSQRVEQ DELRTRMRAT GMSHHEIAIE FARRYRLRPR AAYRVAHGWT QQQAADRINA HAVRAGLDPD GTAPMTAPRL SEVENWPRPA RRRPTPQILA LLAEVYGCDL HALVDVDDRE HLPPADVFLI NGMRRLPDGV AASSTASPIT LAGKRWGTTT ERPVDVAFGA ARAGQAPAPN SVGLADKDNV IIFPQLAPDG RIVLMPLDRR GFLSGLGLTA ASSAALSPLA TMPPGSSSID PRVVDHFARL RAVLAENDNL FGPRQVITTA QEQAGLIAAH LRHGTSSRPQ RQTLLHIQTQ FADLLGWLHQ DSGDNATAGY WLDRALEWSH RASDPNATVF ILARKSQLAA DRGDPAEAVD VADAALTSAE PTGRLAAIAA TYSAHGHALR GEKTTCLTLY DRAHDILDQA GPDTDPWGEF FSPAYIEVQR AHSLAALGDY PAAATGFRTA IDGLPSAFHR DRGVYLAREA LAHAGAREPE QAATLGLNAL TVGASTHSGC IMTSLRSLRD AVAGWQTVSQ VREFRQAMDQ VPTAITV
|
| |