Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1769 |
Symbol | |
ID | 4076798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1860620 |
End bp | 1861903 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007084 |
Product | VWA containing CoxE-like |
Protein accession | YP_613764 |
Protein GI | 99081610 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.606293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAAT ACCCCTCGCT TGCGATCTCG GACGATCCCA AACTCGCCGC GAATATCACC CATTTTGCGC GAGCTCTGCG CAAGGCCGGA CTCAATGTTG GTACCGGGCG GGTGCTCGAT GCGATCAGGG CTGTTGAGGC CGCCGGTTTC ACGTCGCGGC GCGATTTTTA CTGGACCTTG CACGCGTGTT TTGTCTCCCG TCCCGAAGAG CGCGTGGTCT TTGGGCAGGT GTTTCGTCTC TTCTGGCGTG ACCCCCGGTT TCTGGAACAT ATGATGGCGG CGATGCTGCC CGCGATCCGA GGCGTACAGC AGGAACGCGC GGCCAAACCT GCCGAGACCC GTGCTGCCGA GGCATTGCTG GATGGGCAAC TGCCCGAACA TCCCGAGGAA TCACCCGAGG CCACGGACGA AGCAGAGGAG ATCGAAATCG ACGCCGCAAT GACCCTCTCG GCCGAAGAGC GGCTCAAGAC GCTTGATTTT GAACAGATGA CCACCGAGGA AATCCAGCAG GCCAAGCGGA TGCTTGCCAC ACTGAGATTG CCGATTGCGC CTCTGAAAAC GCGCCGCCAT CAACCCGCGC CGCAAGGTGC AAGGCCGGAT TGGCGGCGCA CGATGCGTGG TGCGGGCCGC ACCGGGGGCG AAATCGCGCG CATCGCCCGC AGCAAACGCG CCGAGCGGTT TCCAAATCTT GTGGTTCTCT GCGACATTTC CGGCTCCATG AGCCAGTACA GCCGCATGGT GCTGCATTTT CTGCATGCGG TCGCCAATCG ACCCGCTGAC GGGCGGCAGG GGCGCTGGGC GCAGGTGCAT GGTTTCACCT TCGGCACCCG GCTCACCAAT ATCTCGCGAC ATCTGAAGCA ACGCGATGTG GATGCGGCGC TCGCGGCGGC GGGTGCCGAG GCGCAGGACT GGCAGGGAGG GACGCGGATC GGTGGGTGCC TGCATGCCTT CAACCGGGAC TGGTCGCGCC GAGTCATGGG GCAGGGGGCG GTGGTGCTCC TGGTAAGTGA TGGGCTGGAC CGCGACGTCC CAGAGACCCT GGCGCTGGAG ATGCAGCGCC TGCGGCTTTC TGCAGGGCGT CTTGTCTGGC TCAACCCTTT GCTCCGGTGG GATGGGTTCC TGCCACGCGC CCGCGGCATC CAGGCGATGC TGCCCCATGT GGACAGCTTT CGCGCAGGTC ACAATATTGC GTCGCTTGAA GATCTTGCAC AAGCGCTCTC GCGGCCGGAT GACACTGGCG AAAAGCTGCG CCTGATGGCC ATGATGCAGG AGGAACGCGC GTGA
|
Protein sequence | MVEYPSLAIS DDPKLAANIT HFARALRKAG LNVGTGRVLD AIRAVEAAGF TSRRDFYWTL HACFVSRPEE RVVFGQVFRL FWRDPRFLEH MMAAMLPAIR GVQQERAAKP AETRAAEALL DGQLPEHPEE SPEATDEAEE IEIDAAMTLS AEERLKTLDF EQMTTEEIQQ AKRMLATLRL PIAPLKTRRH QPAPQGARPD WRRTMRGAGR TGGEIARIAR SKRAERFPNL VVLCDISGSM SQYSRMVLHF LHAVANRPAD GRQGRWAQVH GFTFGTRLTN ISRHLKQRDV DAALAAAGAE AQDWQGGTRI GGCLHAFNRD WSRRVMGQGA VVLLVSDGLD RDVPETLALE MQRLRLSAGR LVWLNPLLRW DGFLPRARGI QAMLPHVDSF RAGHNIASLE DLAQALSRPD DTGEKLRLMA MMQEERA
|
| |