Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2589 |
Symbol | |
ID | 4270298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2932553 |
End bp | 2933980 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638127348 |
Product | putative nitrate transporter component |
Protein accession | YP_743419 |
Protein GI | 114321736 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00200575 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAACC GATCACTCAA CGACCCGTTC AACCCGGAGG CGGACCTGCG CCACGGAGCG GGCTGCACCT GCTCCAGCTG CGGTGGCGGC GAACATGCCC AGCACGACCA CGGGGCCGCC CCAGCGGAGC AGGACGCCAA CCAGATGCTG GCCGCGAAGC AGGGCGTCGA CAAAGAGGCC ATGTTCGACC GGGCCGTGGA GAGCGCGGTG GTGCGTGCCC TGTTCGGTCA CCACGACGCC AGCCGGCGTT TCTTCCTGAA GGCGGTGGGC GCCGGCACCT TTGCTGCGGC GGTGGGCTCC ATATTCCCGC TGGATGCGGC CAAGGCCATG CTCAAGGACA ACCTGGGTGA CCCCGAGAAG CGCGACCTGA CCGTGGGCTT TGTGCCCATC ACCTGCGCCA CGCCCATCAT CATGGCCCAC CCCATGGGCT TCTACGAGCG CTACGGGCTG AACGTGGACC TGCGCTCCAC CGCCGGCTGG GCGGTGGCCC GGGATATGTC CATGAACCGG GAGTACGACG CCTCCCACAT GCTCACCCCC ATGCCCCTGG CCATGACCAT GGGCACCGGC TCGTCGAGCA TGCCCTTCAT CATGCCGGCG GTGGAGAACA TCAACGGCCA GGCCATCACC CTGCACAACA AGCACAAGGA CAAGCGCGAC CCCAGCCAGT GGAAGGGCTT CCGCTTCGCC GTGCCCTTCG ACTTCTCCAT GCACAACTTC CTGCTGCGCT ACTACGTCGC CGAGCACGGC CTGGACCCGG ACCGGGATAT CCAGATTCGC GTGCTGCCGC CGCCGGAGAT GGTGGCCAAC CTGCGCGCCG GCAACGTGGA CGGCTACCTG GCCCCCGACC CCTTCAACCA GCGGGCGGTC TGGGAGGAGG TGGGTTTCAT CCACCTGCTC ACCAAGGAGA TCTGGGACGG CCACCCCTGC TGCGCCTTCG CCACCAGCCG CGCCTTTGCC GAGGAGTACC CGAACAGCTT CGGTGCCCTG TTCAAGGCCA TCGTGGACGC CACCCACTAT GCCTCCGAGC ACGAGAACCG GGCCGAGATC TCCGAGGCCA TCGCCCCGCG CAACTACCTG AACCAGCCGG TATCGGTGAT TCAGCAGGTG CTGACTGGCC GCTACGCCGA CGGGCTGGGC AACGTCAAGG AGGACCCGGA CCGGATCGAC TTCGACCCCT TCCCCTGGCA CTCCATGGCG GTGTGGATCA TGACCCAGAT GAAGCGCTGG GGTTACGTGG ACGGCGACGT GGACTACAAG GGCATCGCCG AAGAGGTCTA CCTGGCCACC GACTGCGGCA AACTCATGCG CGAGCTGGGC TACGAGCCGC CGGAGGTCAC CTACAAGAGC CACATGATCA TGGGCAAGGC GTTCGACCCT GAGCAGCCGG AGGCCTACGT GGACAGCTTC GAGATACGGA GGTCGTAA
|
Protein sequence | MSNRSLNDPF NPEADLRHGA GCTCSSCGGG EHAQHDHGAA PAEQDANQML AAKQGVDKEA MFDRAVESAV VRALFGHHDA SRRFFLKAVG AGTFAAAVGS IFPLDAAKAM LKDNLGDPEK RDLTVGFVPI TCATPIIMAH PMGFYERYGL NVDLRSTAGW AVARDMSMNR EYDASHMLTP MPLAMTMGTG SSSMPFIMPA VENINGQAIT LHNKHKDKRD PSQWKGFRFA VPFDFSMHNF LLRYYVAEHG LDPDRDIQIR VLPPPEMVAN LRAGNVDGYL APDPFNQRAV WEEVGFIHLL TKEIWDGHPC CAFATSRAFA EEYPNSFGAL FKAIVDATHY ASEHENRAEI SEAIAPRNYL NQPVSVIQQV LTGRYADGLG NVKEDPDRID FDPFPWHSMA VWIMTQMKRW GYVDGDVDYK GIAEEVYLAT DCGKLMRELG YEPPEVTYKS HMIMGKAFDP EQPEAYVDSF EIRRS
|
| |