Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1973 |
Symbol | |
ID | 3831155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2057287 |
End bp | 2058294 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829904 |
Product | NLPA lipoprotein |
Protein accession | YP_430814 |
Protein GI | 83590805 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000524772 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTGA TGCCAAGAAC TGTTACCCTG ATTTTGGTCC TAACGCTGAC CGCCGGCCTT CTGGCCGGCT GCGGTTCCCA AAAAACCGCA ACTTCTGGCG AAAAACCCGT AATCAAAATA GGCTACCTGC CCATTACCCA TTCCCTTCCA CTGGTAGTGG CCGATGCCCG GCATAAATCC GACTTTAACA ACTTCCGGCT GGAGCTGGTT AAATTCGGCT CCTGGCCGGA TCTTACTGAG GCCTTAAATT CCGGCCAGAT CCAGGGAGCT ATCACCATGC TGGAGCTGGC TCTGGCAAGT AAGGCCAAAG GTATTCCCGT TGAAGTAGTG TTGTTAAGCC ACAAAAACGG TGATGTTCTG GTGGCCGCCC CTCCGATTAA AGATGTAAAA GATTTAAAAG GAAAAAGGGT GGCCATCCCC CACCGCCTGT CCGGTCATAA TATTCTTTTA TACAAGGCCC TGCAGCAGGC GGGCCTGGCT TACAGCGACG TCCAGGAGGT AGAAATGGCC CCGCCGGAAA TGGCCGCCGC CCTGGCCAGG GGCGAAGTGG CGGCCTACGT GGTAGCCGAA CCCTTCGGCG CCCAGGCAGT GGTAGCTGGC ACGGGACGGG TACTAAAACG GGCCCAAGAT ATAATCCCAG GCTGGGAGTG CTGTGGCCTG GTTATAAACC AGCAGTTGGT CAGGGAAAAT CCGGCCGCCG TCCAGGAACT GGTCGGCAGC CTGGTAGACA CCGGTCATTA TATAATGAGC GATCGCAGGA CGGCCATCGA AATGGCCCGG CCGTATATAC CAGTGGCCAG GGAAACCTTG GAGCAGTCCC TGCAGTGGAT CGATTATAGC GATCTCATGC CTACAACAGA GGGACTGGCC AGGATCGAAC AGTACCTCAA AGAAATACCC TGGGATGGCC AGCCGGGACG CCTGTTACCT GGTGGAGAAA TTAAACTGGA AGAACTGGTC GACGACCGCT TCGCCCGGCA GGCCATATTA CCTACGCCGA AAAATTGA
|
Protein sequence | MKVMPRTVTL ILVLTLTAGL LAGCGSQKTA TSGEKPVIKI GYLPITHSLP LVVADARHKS DFNNFRLELV KFGSWPDLTE ALNSGQIQGA ITMLELALAS KAKGIPVEVV LLSHKNGDVL VAAPPIKDVK DLKGKRVAIP HRLSGHNILL YKALQQAGLA YSDVQEVEMA PPEMAAALAR GEVAAYVVAE PFGAQAVVAG TGRVLKRAQD IIPGWECCGL VINQQLVREN PAAVQELVGS LVDTGHYIMS DRRTAIEMAR PYIPVARETL EQSLQWIDYS DLMPTTEGLA RIEQYLKEIP WDGQPGRLLP GGEIKLEELV DDRFARQAIL PTPKN
|
| |