Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1066 |
Symbol | |
ID | 4268987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1245037 |
End bp | 1246428 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638125817 |
Product | hypothetical protein |
Protein accession | YP_741908 |
Protein GI | 114320225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCA CCATCCGCGA CGTGATGACC GATCCCGCCC TGTTCGGTGG CCAGTTCGGT GGCGACACCT GGGCCGCCTG GCGTGCGCTC CTGAGCGGCT TTTATGGCCT CCCGCTGGAC GATGCCGAGG CACAGCACTG GCACGCGCTC ACAGACCGCG AGAGCGCCCC GCAGAGCGCA CATGACGAGT TGTGGCTAGT GGTAGGCCGC CGCGGTGGCA AGTCCAATGC AGCGGCCTTG CTGGCGGTCT ATGAGGCGTG TTTCAAAGAC CACCGCGATG CCCTGGCACC CGGTGAGGTT GCCACCACCC GCGTCATGGC TGCCGACCGT GCGCAGGCCC GCAGCGTGTT CCGGTATATC TCCGGTCTGA TGCACGCGAA TCCGATGCTG GAACGGCTGA TCGTGCGCGA GGATCGGGAA TCCATAGAGC TGTCCAACCG GGCTGTTATC GAGGTGGGCA CGGCCTCATT CCGCACGACA CGCGGCTACA CGTTCGCGGC GGTGATTGCC GACGAGGTGG CGTTCTGGCG CTCCGATGAC AGCGCGAACC CTGACAGCGA GATCATTGCC GCCGTGCGTC CCGGTCTGGC CACGCTGAAC GGCAAGCTGA TCGCGCTTTC CAGCCCATAC GCCCGACGCG GTGAGCTATG GGAAAACTAC CGCCGACACT ACGGCAAGGC ATCGCCCATC CTGGTGGCGC AGGCTCCCAG CCGCACCATG AATCCCTCAT TGCCTGAGCG CGTGGTCACG GAGGCAATGG AGCGTGACCC GGCCAGTGCG GCGGCGGAGT ACCTGGCGGA GTTCAGGACG GACGTGGAGA CCTTCCTGCA ACGCGAAGTA GTAGAGGCCG CCACGCGGCC CACCCCGCTG GAGTTGCCCT ACAACAAGCG CGTTACCTAT ACCGCCTTTG TTGATCCGGC AGGTGGTGGC GCGGATGAGT TCACCGCCGC CATCGGCCAC CGGGAAGGGG AGCGCGTGGT CGTGGACGTG CTACGCGCCC GCAAGGGTAC GCCTGCCGAG ATCGTTGCCG AATACGCCGA CCTGCTGAAG TCCTACCGGA TCACCCGCGC TATCTCGGAT CGTTATGCAG GCTCATGGCC TGCCGACGAG TTCAGCCGCC ACGGGATCAC CGTAGAGCAG GCCGCTAAAC CGAAGTCAGA CCTTTATCGG GACATGCTCG CCAGCATGAA CAGCGCCCGC GTGGAGCTTC CGCCCGATGA TCGGCTGATG ACCCAGCTAA TCAGCCTGGA GCGCCGCACA GCACGCGGTG GCCGGGACAG TATCGACCAC GCCCCCGGTG GTCACGATGA CAGAGCAAAC GCCGTTGCCG GTCTGGTGGC GGCCAACTCA CGCGCCCCAG GCGAACGGAT GCGGGCGCTT TGCACTTGGT AG
|
Protein sequence | MSVTIRDVMT DPALFGGQFG GDTWAAWRAL LSGFYGLPLD DAEAQHWHAL TDRESAPQSA HDELWLVVGR RGGKSNAAAL LAVYEACFKD HRDALAPGEV ATTRVMAADR AQARSVFRYI SGLMHANPML ERLIVREDRE SIELSNRAVI EVGTASFRTT RGYTFAAVIA DEVAFWRSDD SANPDSEIIA AVRPGLATLN GKLIALSSPY ARRGELWENY RRHYGKASPI LVAQAPSRTM NPSLPERVVT EAMERDPASA AAEYLAEFRT DVETFLQREV VEAATRPTPL ELPYNKRVTY TAFVDPAGGG ADEFTAAIGH REGERVVVDV LRARKGTPAE IVAEYADLLK SYRITRAISD RYAGSWPADE FSRHGITVEQ AAKPKSDLYR DMLASMNSAR VELPPDDRLM TQLISLERRT ARGGRDSIDH APGGHDDRAN AVAGLVAANS RAPGERMRAL CTW
|
| |