Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3926 |
Symbol | |
ID | 5672287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4694974 |
End bp | 4696197 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242805 |
Product | hypothetical protein |
Protein accession | YP_001508222 |
Protein GI | 158315714 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.449305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0394993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCCCG TCCACGAGCT GGCGGGAGCG ACCGCGCCGA TCTGGATGTG GACACCTCCC CAGGAGGTCG AAAGCGCCGC GCTCGACCAG CTCCGCACCA TCGCGGAACT CCCCTACGTC CACCACCACG TCGCCGTCAT GCCCGACGTC CACCTCGGCA AGGGCGCTAC GGTGGGTTCG GTGATCGCGC TGCGCGACGC CGTCTCCCCG GCTGCCGTCG GCGTCGACAT CGGATGCGGG ATGGCCGCGC TGCGGACCAA CCTGACCGCC GCCGACCTGC CCGACGACCT CGGGCCGGTG CGGGCCGCGG TCGAGGCGGC CATCCCCGTC GGCCATCGCG GGCACACCGA GATCTCCCGC CGGGTCCGCG GCTACGCCGA CCTGTGGGGG ACCTTCGGCG ACCTACACCC CAAGGTCACC GGGCGCAGCG GCAGCATCGA CCGGGTGATG GCGCAGATGG GCAGCCTGGG CTCGGGCAAC CACTTCGTCG AGCTCTGCCT GGACACCGAG GACGCCGTCT GGCTCATGCT GCACTCGGGC TCGCGCAACA TCGGCAAGAC CCTCGCCGAG CTGCACATCG CGGCCGCGAA GAAGCTCCCG CACAACACCG GCCTGCGCGA CCGGGACCTC TCCGTCTTCC TCGCCGGCAC CCCCGAGATG GCCGCCTACC GAGCCGACCT CACCTGGGCG CAGACCTACG CGCGCCGCAA CCGCGACACC ATGCTCGCTC TCTACGTCGA CGCGCTGCGC GACCACCTGC CCACGCTGCG CGTGCCGACC CCGCCCACGC ACGGGCCGCA CGCCGCCGAC ACCGCCTTCG CCGCGGTCAA CTGCCACCAC AACTACGTCG CCGAGGAGCA CCACTACGGA GCGGACGTCC TGGTCACCCG CAAGGGCGCG ATCTCGGCCC GGGCCGGCGA GTACGGGATC ATCCCCGGCT CCATGGGCAC CCGCTCGTAC ATCGTCCGCG GCCTCGGTAG CGCCGAGTCG TTCCACTCGG CCGCGCACGG CGCCGGGCGG CGGATGAGCC GTACCCGTGC CCGCAAGGAG TTCACCACCG ACGACCTCGT CGCCCAGACC ACCGGCGTCG AGTGCCGCAA GGACCCCGGA GTCCTCGACG AGATCCCCGC TGCCTACAAG GACATCGACG CCGTCATCGC CCACCAGAGC GATCTCGTGG ACGTCGCCGC CGAGCTGCGC GCCGTCCTCT GTGTGAAGGG CTGA
|
Protein sequence | MTPVHELAGA TAPIWMWTPP QEVESAALDQ LRTIAELPYV HHHVAVMPDV HLGKGATVGS VIALRDAVSP AAVGVDIGCG MAALRTNLTA ADLPDDLGPV RAAVEAAIPV GHRGHTEISR RVRGYADLWG TFGDLHPKVT GRSGSIDRVM AQMGSLGSGN HFVELCLDTE DAVWLMLHSG SRNIGKTLAE LHIAAAKKLP HNTGLRDRDL SVFLAGTPEM AAYRADLTWA QTYARRNRDT MLALYVDALR DHLPTLRVPT PPTHGPHAAD TAFAAVNCHH NYVAEEHHYG ADVLVTRKGA ISARAGEYGI IPGSMGTRSY IVRGLGSAES FHSAAHGAGR RMSRTRARKE FTTDDLVAQT TGVECRKDPG VLDEIPAAYK DIDAVIAHQS DLVDVAAELR AVLCVKG
|
| |