Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1678 |
Symbol | |
ID | 5670080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2006581 |
End bp | 2007777 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240596 |
Product | secreted protein |
Protein accession | YP_001506022 |
Protein GI | 158313514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.785401 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAT TGACCCGCAT CGGCGGCTTC GGTCTCGCGT TGGCCGTGCT CTTCGGTGTC TCCTATGTGG TCGGGACGAC GATCGGGCCG TCCGAGGAGT CCAACGGTGC GGCCGGGACC GGCGCCCATG GCCACGAGGC CTCCGCCGCG CCGGGCGGCG CGCCGGGCGT GACCAGCGGT GATCCCACCG CGGCCGGGCT CGCGATCTCC GCGGGCGGCT ACACGCTCGC GCCCGACACG CTCACCTGGC CCGCCGGCCG CTCGCAGCCC TACACCTTCC GCGTCCTCGG CCCGGACGGC CGCCCACTCA CCGCCTTTAC CCGGGTACAC GACGCAGACC TGCACCTGAT CGTCGTGCGG CGTGACCTCA CCGGCTACCA GCACCTGCAT CCCACCCGTG ACGCCGCCGG CACGTGGAGC GTCCCGCTCC GACTGTCCGC GCCCGGCGTC TACCGCGTGT TCGCCGACTT CGCTCCGACG TCCACGCCGG ACGGCGCCGA CAGCGGTACC GGTACCGGTG GCACCGCGGC CGGCAGCATG GCGGACGGGG CGAGCATTCC GGGAGTCCCG GTCGCTCCGG TCACGCTCGG CACCTTCGAG ATCTCCGGCA TGTCGGAGAT CTCCGGGAAT TCCGAGATCT CCGGGACCTC GGGTAGCTCG CCGGCCACGT CCACGATCAC GCTCGGCACG GACCTCACCG TGGGCGGCGA CCTGCGCCCC GCCACGCTGC CGGCGCCGTC GCGCACCGCG ACAGTGGCGG GTGGGTACAC CGTGACGCTC GGCGGGCAGG CCAGCCCCGA CGGGATGAGT GACCTCGTCT TCTCGGTCAC CCGCGCCGGA CGGCCGGTCA CCGATCTCGA GTCGTACCTG GGCTCGCTCG GTCATCTGGT GGTGATCCGG CAGGGTGATC TCGCCTACCT GCACGTCCAC CCGGGAGAGA GCGGGGGCTC CACCAGCGGC GCCGTGGCGA CGCCCGGAGG TCTCGGCGGC GGCATGAGCG ATCCCACGGC CAGTCCCGGG AGCGGCGATC CCGCGCTGTC GTTCATGACC GAGTTCCCGT CGGCCGGTGC CTACCGCCTC TTCCTGGACT TCAAGCACGC CGGACAGGTG CGAACCGCGG AGTTCACCAT GACGGTCCCG GTGGACGGGG CCGCGCCCAC GCCGACGCGG ACGCCCACCT CCGCGCACGC GCACTGA
|
Protein sequence | MNALTRIGGF GLALAVLFGV SYVVGTTIGP SEESNGAAGT GAHGHEASAA PGGAPGVTSG DPTAAGLAIS AGGYTLAPDT LTWPAGRSQP YTFRVLGPDG RPLTAFTRVH DADLHLIVVR RDLTGYQHLH PTRDAAGTWS VPLRLSAPGV YRVFADFAPT STPDGADSGT GTGGTAAGSM ADGASIPGVP VAPVTLGTFE ISGMSEISGN SEISGTSGSS PATSTITLGT DLTVGGDLRP ATLPAPSRTA TVAGGYTVTL GGQASPDGMS DLVFSVTRAG RPVTDLESYL GSLGHLVVIR QGDLAYLHVH PGESGGSTSG AVATPGGLGG GMSDPTASPG SGDPALSFMT EFPSAGAYRL FLDFKHAGQV RTAEFTMTVP VDGAAPTPTR TPTSAHAH
|
| |