Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4874 |
Symbol | |
ID | 5673214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5847139 |
End bp | 5848497 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243729 |
Product | hypothetical protein |
Protein accession | YP_001509145 |
Protein GI | 158316637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.153139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCGCC GCATCTTCGG GCTGGAGAAC GAGTACGGCG TCACCTGCGT GTTCCGCGGG CAGCGAAGGC TGTCGCCGGA CGAGGTGGCC CGTTACCTGT TCCGACGCGT CGTGTCCTGG GGCCGTAGCA GCAACGTGTT CCTCAAGAAC GGCGCCCGGC TCTACCTGGA CGTCGGCAGC CACCCGGAGT ACGCCACCCC CGAGTGCGAC TCCGTGCCCG ATCTGGTGAC GCACGACAAG GCGGGCGAGC GGATCCTCGA GGGCCTGCTC GTCGAGGCGG AGCGCCGGCT GCGCGAAGAG GGCATCGCCG GGGACATCCA CCTGTTCAAG AACAACACCG ACTCGGCGGG GAACAGCTAC GGCTGCCACG AGAACTACCT CGTGGGGCGG CACGGCGAGT TCAGCCGCCT GGCGGACGTG CTGGTCCCGT TCCTGGTCAG CCGGCAGATC CTGTGCGGCG CGGGCAAGGT CCTTCAGACC CCGCGCGGGG CGGTGTACTG CATCTCCCAG CGCGCCGAGC ACATCTGGGA GTCGGTGTCG TCGGCGACGA CCCGCTCGCG TCCGATCATC AACACCCGCG ACGAGCCGCA CGCGGACGCC GAGCGCTTCC GTCGGCTGCA CGTCATCGTC GGTGACTCGA ACATGAGCGA GACGACGATG CTGCTCAAGC TGGGCTCCAC CGACCTGGTG CTCCGCATGA TCGAGGCGGG GGTCGTCCTG CGCGACATGT CGCTGGAGAA CCCGATCCGG GCGATCCGCG AGGTGTCCCA CGACATGACC TGCCAGCGCC GGATCAAGCT GGCGAACGGC CGTGAGGTGA GCGCGCTGGA CATCCAGCGC GAGTACTACT CGAAGGCGGT CGAGTTCGTC GAGCGGCGCG GCGGGGACGC CGTGGCGAAG CGGGTGCTCG ACCTGTGGGG CCGCACCCTG CTCGCGATCG AGACCGAGGA CCTGGAGCTG GTCGCCCGGG AGATCGACTG GGTCACCAAG TTCGTCCTCA TCGACCGCTT CCGGCTCAAG CACGGGCTGT CGCTGGCGTC GCCGCGGGTG GCCGAGCTCG ACCTGAAGTA CCACGACATC CACCGTGACC GCGGGCTCTA CTACCGGATG GAGCGGGCCG GCCTGGTCGA GCGGGTCACC CGCGACCTCG ACATCTTCGA GGCGAAGTCG GTGCCGCCGC AGACCACCCG GGCCCGGCTG CGCGGGGAGT TCATCAAGCG GGCGCAGGAG AAGCGGCGCG ACTTCACCGT CGACTGGGTG CACCTCAAGC TCAACGACCA GGCCCAGCGC ACGGTGCTGT GCAAGGACCC GTTCCGGTCG GTGGACGACC GCGTCGACAA GCTGATCGCG AGCATGTAG
|
Protein sequence | MDRRIFGLEN EYGVTCVFRG QRRLSPDEVA RYLFRRVVSW GRSSNVFLKN GARLYLDVGS HPEYATPECD SVPDLVTHDK AGERILEGLL VEAERRLREE GIAGDIHLFK NNTDSAGNSY GCHENYLVGR HGEFSRLADV LVPFLVSRQI LCGAGKVLQT PRGAVYCISQ RAEHIWESVS SATTRSRPII NTRDEPHADA ERFRRLHVIV GDSNMSETTM LLKLGSTDLV LRMIEAGVVL RDMSLENPIR AIREVSHDMT CQRRIKLANG REVSALDIQR EYYSKAVEFV ERRGGDAVAK RVLDLWGRTL LAIETEDLEL VAREIDWVTK FVLIDRFRLK HGLSLASPRV AELDLKYHDI HRDRGLYYRM ERAGLVERVT RDLDIFEAKS VPPQTTRARL RGEFIKRAQE KRRDFTVDWV HLKLNDQAQR TVLCKDPFRS VDDRVDKLIA SM
|
| |