Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2840 |
Symbol | |
ID | 5671229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3356044 |
End bp | 3357555 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241749 |
Product | hypothetical protein |
Protein accession | YP_001507169 |
Protein GI | 158314661 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.43612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCGAC ACAGTCTGCG GGTGGAGCAG GATGAGCTGC GGGCCCGGAT GCGCACGGTC GGCATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGCCCCCGC GCCGCCCATC GCCATGCCCG CGGCTGGACC CAGACGCAGG CCGCGAACCA CATCAACGCC CACGCCGCCC GCGCCGGCCT CGACCCGGAC GGCGCTGCCC CCATGACCGG CCCGAAGCTG TCGGAACTGG AGACCTGGCC GCTGCCGAGC AACCGCCGCC GGCCCACCCC CCAGATTCTC GCCCTGCTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT GGACGACCGC GAACACCTTC CCCCCACCGA CATGCTCCTC ATCAACACCG CGCGGAGGAA CCCTGCGGCG GACCCGGAAC GTGGATCAGC ATCGGCCCCT GTGCTCGCGG ACGACTCAGG CCAGCCTCGC AGAGGCGAAC AGATCACGAT GCCAGTCGTA CCAACAGCGC AGGTTATGAG GCCCGCAGAT GCTACGGGAG CCGCGATGCC CGACCTCGAC CGACGAGACT TCGTCACCGC GACGGCCACT GCGCTCACGT TCGGCAGGCC CGCCCTCCGG AACGTAGACC CGGCCCTCAT CGACTACTTC AACCAGCAGC TGGAAGGCCA CTACCAAGCG GACATGATGC TCGGCCCCCG CGAGCTAATC AGCACCGTCA CCGCCCAGCA CACTCTCATC AGCAATCTGG TGGCGACCGG CCACGGGGGC ACGCGGCGGG CCCTTCTCGG CGCAAGCGCG GCCTACGCCT CACTGATCGG CTGGCTCCAT CAGGACGCCG GTGACCTTCC GAGTTCGTCC GTCTGGCGTG GCATCGCGCT GGAAGCCGCA CAACGCTCCC GGGACCACCA GCTCGTCGCC TACGCGCTGC TCAACCACGC ATCCGTTCGC ACAGATCTTG CCGACGGTCT CGGTGCGCTC GACCTGTGTG GCGCGGTCCT GGCAGACGCC GGCCGGCTCA GCCCGAAGAT GCGGGTCCTA GCGCTCCAGC AACAGGCCCA CGGCGCGTCG CTCATCGGAG ATCGTGCGAC CGTCGACTCC GTCCTGGACC AAGCGGCCCC GCTCGTCGAG AGATGCGACG ACGCCATGCC GTGGGGCAAC GCCTGCCGGC GCACGCCGGC CTACCTGGAG GTGCAACGCG CCACCTGCTA TGGACGCTTG GGACTGGCCA CCGCCGCCAC CGGCCTGTGG CAGCAGGTTC TCGCAACGAC GCCTACCCAT GCCCGGCGGG ACCGCGGCGT GTATCTCGCC CGTCAGGCCA CCGCATGTGT CCAGAACGGA GATCTGGGGC GCGCGGTCGA GGCCGGACGG CTCGCGGCGG ACGTGGCGGT GGAAACTGGG TCTGTGCGGA TTCGCCGCGA GCTCGCCGGC CTACGGCAAG CCGCAGAACC GTGGAAGGGC ACGGCCGTCG GCCGCGACCT CGACGAGATC TTCGCGCTCT GA
|
Protein sequence | MSRHSLRVEQ DELRARMRTV GMSHDEIAIE FARRYHYRPR AAHRHARGWT QTQAANHINA HAARAGLDPD GAAPMTGPKL SELETWPLPS NRRRPTPQIL ALLAEVYDTS IHNLIDLDDR EHLPPTDMLL INTARRNPAA DPERGSASAP VLADDSGQPR RGEQITMPVV PTAQVMRPAD ATGAAMPDLD RRDFVTATAT ALTFGRPALR NVDPALIDYF NQQLEGHYQA DMMLGPRELI STVTAQHTLI SNLVATGHGG TRRALLGASA AYASLIGWLH QDAGDLPSSS VWRGIALEAA QRSRDHQLVA YALLNHASVR TDLADGLGAL DLCGAVLADA GRLSPKMRVL ALQQQAHGAS LIGDRATVDS VLDQAAPLVE RCDDAMPWGN ACRRTPAYLE VQRATCYGRL GLATAATGLW QQVLATTPTH ARRDRGVYLA RQATACVQNG DLGRAVEAGR LAADVAVETG SVRIRRELAG LRQAAEPWKG TAVGRDLDEI FAL
|
| |