Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5607 |
Symbol | |
ID | 5673935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6802712 |
End bp | 6803653 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244461 |
Product | hypothetical protein |
Protein accession | YP_001509865 |
Protein GI | 158317357 |
COG category | [S] Function unknown |
COG ID | [COG3786] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.133925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000118142 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGACAC ACCGTAGGTC CGGGCGGTCC GAGGGCCGCC GACTCAGGGG CGCACGCGGC GACCGCGCCA CCGCGCAGAC CGTCGCGATC GGGATGGCCG CGGTCGGCGC GACACTGGTG GTCGGCGCCG CCGGAGCACT GCACGTGACC GGCTCGGTGG AGGATCCGGA CATGTCGACG ACGGCGCAGG CAGGCGACGG GGCCGTTCCC ACCACTTCGG CGCGGCCCGA CGTGGCCACG CCGCACCGCA TCACCGACTC CACCGGCAAC CCCGATCCCG CGGCGGCCGA CGGTCGCGCG GCCGCGGCCC CGGCCCCGGG CGCGGAGCAG TCCCGCGGCG CGATCCCCGG CATCGGCGAC CTCATGATGT CGCGGATCCC GTCGTCGACC CGGCAGGTCA TCGTCGTGAC CGGCGCAGAC CAGTCCTCAA CCGTGAACCG CGTCGTGCTC TGGCAGCGCG CGGACGCCGA CGCCCCCTGG ACAGCCGTCG GCGCGGAGAT CCCCGGCCGC AACGGAGCCA ACGGCTGGAC CCACGACCAC GTCGAGGGCG ATCTGCGTAG CCCCGTGGGC GCGTTCAGCC TCACCGCCGC CGGGGGCCGC TACGCCGACC CGGGCACCGC GCTTCCCTAC GAGTACCGCC CGTCCTTCTA TCAGGCCGGC GGCTACGAGG GCGACCCGAT GGGGGAGGCC TTCAACTACG TGGTCGCGAT CGACTACAAC CGGTTGCCGG GGCATCCGCC GTCCGACCCG ACCCGGCCGC TGGGCGCCGC GGCCGGCGGC GACATCTGGC TGCACGTGGA CCACAACACA CCCACCCGCG GCTGCGTCAG CCTCCCGCAG GCCTCGATGG AGACCGTCCT GCACTGGCTG GCCCCGAGCA GCCATCCCAT GATCATCATG GGTGACCGGG CAAGCCTGGA GGCGTCCGCC GGCCAGCAGT GA
|
Protein sequence | MATHRRSGRS EGRRLRGARG DRATAQTVAI GMAAVGATLV VGAAGALHVT GSVEDPDMST TAQAGDGAVP TTSARPDVAT PHRITDSTGN PDPAAADGRA AAAPAPGAEQ SRGAIPGIGD LMMSRIPSST RQVIVVTGAD QSSTVNRVVL WQRADADAPW TAVGAEIPGR NGANGWTHDH VEGDLRSPVG AFSLTAAGGR YADPGTALPY EYRPSFYQAG GYEGDPMGEA FNYVVAIDYN RLPGHPPSDP TRPLGAAAGG DIWLHVDHNT PTRGCVSLPQ ASMETVLHWL APSSHPMIIM GDRASLEASA GQQ
|
| |