Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4197 |
Symbol | |
ID | 5672552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4996976 |
End bp | 4998307 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243070 |
Product | hypothetical protein |
Protein accession | YP_001508487 |
Protein GI | 158315979 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.017102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.75785 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCA ACCGCCGTCA GGTGGTCGTG GGAACCGGCG CGGCCGGCCT GGGCTTCACC CTGTCGGGTG CGGTGAGCTC GGTGTTCGCC GGCACCGCGT CCGCCGCAAC ACCGAAGAAG TTCGCCGGGT ACGGCGAGCT GGTCCCGGAT CCGAAGGGCC TGGTCGATCT TCCCTCCGGC TTCCGGTACA CCGTGCTGTC CCGAGCCGGG GTCGACTCCC TGACCGGCGG GGGTGTGGTG CCCGGCGCGC CCGACGGGAC GTACGCGTTC CCACTCGGCC CGGGCCGCAG CGTTCTCGTC CGCAACCACG AGCTCTCCCC CGGAGGCACG GACCTCGTCC CGCAGCGGCC GGGCATCACC TACGACCCGG CCGCCCCGGG CGGGACGACG ACCGTCACCG TCGCCGGTGA CCGGCTGCTG TCGGCCGTGC CCAGCCTGGC CGGCACCATC CGCAACTGCG CCGGCGGCAA CACCCCGTGG CGCACCTGGC TGTCCTGCGA GGAGACCGAG GACACCCCGG CGACCAACCC CGCGCTCACC AAGCGGCACG GCTACGTCTT CGAGGTCGAC CCGTTCGGCC GGCTGCGTGA CCGCGAGGCG GTGCCGCTGA CCGCGCTCGG CCGGTTCGCG CACGAGGCCG TCGCGGTCGA CCCCCAGTCG GGCTGCCTCT ACCTGACCGA GGACGCGTCC AAGCCGTACG GCCTGATCTA CCGCTTCCTG CCCCGCCGGC CGCTCGGCGG GCCCGGCAGC CTGCGGGCCG GCGGCAAGCT GCAGGCGCTG CAGGTCCCCG GTGTCCCGGA CCTGTCCGCC ATCAGCGAGC TGCACACCAC GGTGCAACTG TCCTGGGTCG ACGTCCCCGA CCCGGACGCC GCCACGGTGT CGACCCGCAA GCAGTTCGCC GCCGGCAAGG TCACCCAGGT CCCGAAGGCC GAGGGCATCT TCTGGTCCGG GCGCTCCGCC TACGTGGTCT CCAGCTACGC CAAGACCGCG GACGGCGCCG CCCGTGACCA CGCCGGCCAG GTCTGGAAGC TCGACCCGAA GAAGGGCACC CTCGAGCTGG TGCTGCTGAT CGAGCCGGGC GGCCGGTTCG ACGGCCCCGA CAACATCACC GTGTCGCCGG GCGGCGGCAT CGTGCTGTGC GAGGACGGCG ACGGCGAGCA GCACCTGATC GCGCCGTCGG CCGAGGGTGT CCCCTACCCG CTGGCCCGCA ACGCCACCAG CGAGAGCGAG TGGGCCGGCG CCACGTTCTC CGCGGACGGC CGCTGGCTCT ACGCCAACAT CCAGAGCGAC GGCCTCACGG TGGCGATCAC CGGCCCGTGG TGGCGCGGCT GA
|
Protein sequence | MAVNRRQVVV GTGAAGLGFT LSGAVSSVFA GTASAATPKK FAGYGELVPD PKGLVDLPSG FRYTVLSRAG VDSLTGGGVV PGAPDGTYAF PLGPGRSVLV RNHELSPGGT DLVPQRPGIT YDPAAPGGTT TVTVAGDRLL SAVPSLAGTI RNCAGGNTPW RTWLSCEETE DTPATNPALT KRHGYVFEVD PFGRLRDREA VPLTALGRFA HEAVAVDPQS GCLYLTEDAS KPYGLIYRFL PRRPLGGPGS LRAGGKLQAL QVPGVPDLSA ISELHTTVQL SWVDVPDPDA ATVSTRKQFA AGKVTQVPKA EGIFWSGRSA YVVSSYAKTA DGAARDHAGQ VWKLDPKKGT LELVLLIEPG GRFDGPDNIT VSPGGGIVLC EDGDGEQHLI APSAEGVPYP LARNATSESE WAGATFSADG RWLYANIQSD GLTVAITGPW WRG
|
| |