Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4207 |
Symbol | |
ID | 5833243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4681821 |
End bp | 4682861 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369997 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001641647 |
Protein GI | 163853604 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.532408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGC CAATCACTTT CAATAATACG ACGTATGCTC CCGGATTTCT AGGAACCGAC GACGGTGGTG CATCTGGAAA CTTTCAGGTC GATACCGCCT CGACCTATTC CGTCACAGTA TCAGGGACCA TCAACGCTGT CGGTGATCCC GTCACTCTGA CCTATGGAGC CGATGCTCCC GCCGGTTTTG CAGGCACGTC CGTTCAGTTG ACCTCGACGC AGTTCGACAA TTCAGGCCAG ATCCTGTTCG TGAGCAGGGC CATCCCGCCC GGTGAGACGG AGACCGGCAA CTACCGCTAC CTCCTCTCGA ACACCCAGGT GGTCGGCTCG AACCCGCCTC CCGGCTCCAC CCGGACCCGC TTCCTCGCCG ACGGCAACAA CACGGCCGGC GATTACAACG TCCAGGCCGC GCCCTGCTTC ACCACGGGCA CGCTCATCCG CACGGCTCGC GGCGAGGTGG CGGTCGAGGA TCTGATTGTC GGCGATCTCG CCGTGACGGC TTCCGGCACG CTGCGTCCGA TCACCTGGAT CGGCAACCGC GCCCTCGATG CCAAGGGCGA GGCGCTGCCC CACAACGAGC AGCCCATCCG GATCCGCGCG GGTGCCTTCG GCCCCGGCCT CCCGGCGCGC GATCTGCGCC TCTCGCACGG CCATCCGGTG CTCGTCGGCG CCGATGCCAA CGGCGAGGGC GGCGTGCTGG TGCCCGTGAT GTGCCTGATC AACGGCACCT CCGTCCTCCG CGAGCCGGCG ACGCAGGTGA CCTACTGGCA TATCGAGCTG GATGCGCACG ACATCCTGCT CGCCGAAGGT CTGGCCGCCG AGAGCTACTA CGACATGGGC AGCCGCGTTT GGTTCGCCGG CGAGGACGGC ATGCTGACCG ATCCGGACTT CGTGCCGGCC TGCGAGCACG GCCGCTGCCG CCCTGTGGCG GTGGACGGCG CCCTCGTGGA CGGTGAGCGG CAGCGGCTCG ACGGCGTCTT CGCCGCGGAG CTCGATGGGC ACAGCGCCTG GGCCGACGCA CCGGTGTGGC ACGCCGCGTA A
|
Protein sequence | MALPITFNNT TYAPGFLGTD DGGASGNFQV DTASTYSVTV SGTINAVGDP VTLTYGADAP AGFAGTSVQL TSTQFDNSGQ ILFVSRAIPP GETETGNYRY LLSNTQVVGS NPPPGSTRTR FLADGNNTAG DYNVQAAPCF TTGTLIRTAR GEVAVEDLIV GDLAVTASGT LRPITWIGNR ALDAKGEALP HNEQPIRIRA GAFGPGLPAR DLRLSHGHPV LVGADANGEG GVLVPVMCLI NGTSVLREPA TQVTYWHIEL DAHDILLAEG LAAESYYDMG SRVWFAGEDG MLTDPDFVPA CEHGRCRPVA VDGALVDGER QRLDGVFAAE LDGHSAWADA PVWHAA
|
| |