Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3856 |
Symbol | |
ID | 5672219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4583676 |
End bp | 4584665 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242734 |
Product | hypothetical protein |
Protein accession | YP_001508154 |
Protein GI | 158315646 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.213587 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGTTG AGGCGCCGGC CGAGGCCGTA GGGCCCGAGC CCGCGTGCGG CCGGCGGCCG CCGGCGGGCT CGGCGAGCCA CCGGTGGCGG ACGGCCCTCG CCTGCGTCGT CCCGCTGCTC GCGGTGCTCT GGCTGCTCCA CAGCTGGTCG ACGGTCACGG CGAGCGTCCG CCACCTGGCG GCCGCGCACC CGGGCTGGCT CGCCGTCGCG GTGCTCGCCG CCCTGCCGAC CTGGATCGCG GGAGCCGCCA GCCTGCAGGG GGCGGTCGCC CGGCGGCTGC CCGTCGGGCC GATGCTGGCC GTGCAGGTGG CGGGCAGCGT CGCCAACCAC GTGCTGCCCG CCGGCTTCGG CGTCGGGGCG GTGAAGCTGC GCTTCCTCAA CCGGCACGGC GTCCCTCTGC GCGAGGCGGT GGCCGCGGTC GGACTCGACG CCACCGCCGG AATGATCACC CATGTCGCGG TTCTCGTCGC GCTGCTCTCC GGCGGGTTCC TGCACATGAG CGGGCCACCT GGTGGGCTGA TCGCCACGGT GGCCGCCGTG GTGGCCGGGC TGCTCGCGGT CGGCTGGGCC CTGCCGCCGA TCCGGCGGGC GTGCCGGCGC GGCTGGACGC ACGTGACGGC CCAGGCCCGG ATGCTGGCGG AGATCCTGCG GGTGCCGAGC CGGGCGGTGA TGCTCTGGGG CGGCTCCACC GCGATCCCGC TCCTGCACGC GGCGACCCTG CTGTTCGTGG TGCGGGCGCT ACATCTCCCG CTGGGCGCCG GCGCGGTGTT CGCCATCTAC TACCTGGCCA GCAGCGCGTC CGCGCTCATC CCGTCGCCGG GCGGTTTCGG CTCGCTGGAC GCGGCGCTGA CCGCGGCGAT CGTCGCGGCG GGGCAGTCAC CCACCTCCGC CCTCGCCGCC GTCCTCGGGT ACCGGCTGAT CACGGTGTGG ATCCCGCTCG CGCCCTCGGC CTGCGTCATG GCGGCGCTGG TACGGCGGGG TCACCTCTGA
|
Protein sequence | MVVEAPAEAV GPEPACGRRP PAGSASHRWR TALACVVPLL AVLWLLHSWS TVTASVRHLA AAHPGWLAVA VLAALPTWIA GAASLQGAVA RRLPVGPMLA VQVAGSVANH VLPAGFGVGA VKLRFLNRHG VPLREAVAAV GLDATAGMIT HVAVLVALLS GGFLHMSGPP GGLIATVAAV VAGLLAVGWA LPPIRRACRR GWTHVTAQAR MLAEILRVPS RAVMLWGGST AIPLLHAATL LFVVRALHLP LGAGAVFAIY YLASSASALI PSPGGFGSLD AALTAAIVAA GQSPTSALAA VLGYRLITVW IPLAPSACVM AALVRRGHL
|
| |