Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1378 |
Symbol | |
ID | 5833456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1546630 |
End bp | 1547727 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367178 |
Product | hypothetical protein |
Protein accession | YP_001638850 |
Protein GI | 163850807 |
COG category | [S] Function unknown |
COG ID | [COG2847] Uncharacterized protein conserved in bacteria [COG4549] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0591071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000407941 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCAGACA ACCGGAACCG TCCCATGCCT CGCCTCTGCC TCAGCGCCCC GCTTCGCGCC GCCCTGCCGC TCGTCCTGAT CGGCTCAAGC GCTGTTCTTT GCGCCGTCTT CGGCCTCGTC TCGCCCGCCT CGGCCCATGC CGTGCTGGAG CGCAAGGAGG CCGCGCCGAA CGCTGCCTAT CGCGGCGTCG TCCAGATCAT GCACGGCTGC GATGGCCGGC CGACCACCCG CATCAGCGTC ACCATCCCTG AGGGCGTGAC CGGGGCCAAG CCGATGCCGA AGCCCGGCTG GACGATCGAA ACCGTCAAGT CGGCCTATGC CCGGTCCTAC CCGTCCTTTC ACGGACAGGT CTCGGAGGGC GTCACGAAAA TCACCTGGAG CGGCGGCAGC CTGCCGGACG AGCAGATCGA CGAGTTCACC TTCTTCGCCC GGATTTCCGA CGCCTTCGCG CCGGGCGCGA CGATCTACTT CCCCGTCGAG CAGGACTGCA CCGAGGGCAG CTACCGCTGG AGCGATGTTC CGGCCGAGAA CGGAAACGCG CAGGCCCTGA AGGCGCCGGC ACCGGCGGTG CGGATCATCG CCGCGCAAGG AACGGCGCCC CAGGCAACGC CGACCCAAGC AACGACGACG CAGGCTTCGG CCGCCCCCGC CGCGAAAACC GGAGCGATCG CGATCGAGAC GCCTTGGCTG CGGGCGACAC CGGGCGGCGC GAAGGTTGCC GGCGGCTACG TGACTTTGCG CAACACCGGC ACCGAGCCCG ACCGGCTGAC GGGCGCCGCG ATCCCCCAGG CGGGCCGCGC CGAGATCCAC TCGATGACGA CGGAGGGCGG CGTGATGAAG ATGGCGCCCG TCGAGGGCGG TCTTGCCCTT GCGCCGGGGG CCGGCGTCGC GCTCAAGCCC GGCGGCTACC ACCTGATGTT CCTCGACCTG AAGGACGGGC TGAAGGCGGG CGAGACCATC GCGGGCACGC TGACCTTCGA GCGGGCCGGA ACGGTGCCGG TGACCTTCAC CGTGGCACCG ATCGGCGCGC AAGGACCCGA CAGGCAAGGA CCCGGCGCCA CCGCGCTCGA GGCCGACGGC CACAAGCACC ACCACTGA
|
Protein sequence | MSDNRNRPMP RLCLSAPLRA ALPLVLIGSS AVLCAVFGLV SPASAHAVLE RKEAAPNAAY RGVVQIMHGC DGRPTTRISV TIPEGVTGAK PMPKPGWTIE TVKSAYARSY PSFHGQVSEG VTKITWSGGS LPDEQIDEFT FFARISDAFA PGATIYFPVE QDCTEGSYRW SDVPAENGNA QALKAPAPAV RIIAAQGTAP QATPTQATTT QASAAPAAKT GAIAIETPWL RATPGGAKVA GGYVTLRNTG TEPDRLTGAA IPQAGRAEIH SMTTEGGVMK MAPVEGGLAL APGAGVALKP GGYHLMFLDL KDGLKAGETI AGTLTFERAG TVPVTFTVAP IGAQGPDRQG PGATALEADG HKHHH
|
| |