Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1466 |
Symbol | |
ID | 4029180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1659872 |
End bp | 1661662 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637966649 |
Product | von Willebrand factor, type A |
Protein accession | YP_573518 |
Protein GI | 92113590 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03503] conserved hypothetical protein TIGR03503 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCGC TGTTCGCCTT ATTGCTGGTA TGCGTCGGCG GTCTGATGTC GTCGCCGGGA TGGACGCAAG ACGCGACGCC CCCGGATGTG CGCATGATTT TCGACGTCTC GGGCAGCATG AAGGCCAACG ATCCCGCCAA CCTGCGTGCC TCGGCGCTGC AGCTGGCGGC GGCGCTGCTG CCGTCTCAGG CGCGCGGTTC GGTCTGGACG TTCGGTACGC AGGTGCGCAA CCCCTTGCCC GACGGCAAGG TGGATGCCGA GTGGCGGCGG CGTGCGCGCT CGCTGTCGCC GCAACTGGTC GATTACCAGC AGTTCACCGA TATCGAGCAG GCGCTGCGCG AGGCCAGCCA GGCAGCGGGC GGCAAGCGCC ACGTGATCCT GCTGACCGAT GGCATGGTCG ACCTGCCCGG GAGCGGCGAG GTGAAGCGCA AGCGTGATGC GGCCTCGCGC GAGACCTTGA TTGCCTCGCT GGCGCCCGAG CTGGCCACTC AGGACGTGGT GGTGCATACC ATTGCACTCT CGCGCAATGT GGACCGCGAC CTGCTCGAGC GCGTGTCGCA GTCCACCGAC GGCCTGGCAG CAGTGGCCGA GACGCCGGAA GAACTGCTGC GCGCCTTTCT CGATGTCCTC GAGCGCATCG TGCCCAGCGA CCGGGTGCCG CTGGAGGACG GCCGCTTCGA CATCGACCCC GAGGTTGACG GTTTCAACGC CTTGCTTTTC CACGACCAGG ACGCGCCCGG TGCCACGCTG GTGGGCCCGG ATGGCGAGCG TTACACCCGC GATGATCATC CCGACGACAT TCTCTGGCAG AGCATGCCGC GTTACGACCT GATCCGGGTG CCCGATCCCG CTGCAGGGGA ATGGCATGTC GAAGGCCAGG TGGGCGACAG TCGGGTCACC GTCGAGTCGC CGCTGACGTT GCGCACCGAG ACCATGCCGA CGTCCTTGTA CCTCGGCTTC GACACGCCGC TCGAGGCCTG GTTGTCACGC GAGGGCGAGA CACTCGTGGG CGATGCCATG CCCGAGGGCA TGCGGATGCG CGCCGAGCTG CGCGATCTGG ACGACGCGAC GCTGTCGTCG ACGACGCTGA CCGCGGGAGA GGACGGCCAT TTCACCGGAA CCCTGCCGGC GCCCGAGCAG GAGGGCAACG CGCGTCTGGT GGTGACCGCC GAAGGCCCCA CGCGCGTGCG CGAGCGCGTT CAGGGGGTCA ATGTCATCCC TCCCTTGGCG GCTTCCCTGA ACGACGATGC CACCACGGTC ATCCTCGAGG CCCAGCACCC GCGACTCGAC GCCGATAACA CCCGGGTGTC GTCGAGCGTG CTGGGCGAGT CGCTGGACGT GGAACCGGTC GGGCCGAAGA CATGGCACAT TGCATTGCCC GACCTCGATC CGCATCAGTC GGTCCCGCTG GAACTCACGC TCGAGGTGAC GCTGGATGGT CGTACCTGGT CGATTCGCCT TCCCGCGCTG CGCCTCAATC CCGATGCGCG CATCGGGCTT TCCGGCGCCG ATGTCGGTGC CGCCCCGTCA GCGGAGGCCT TGCCGGACAC CGGCGAGGAA CGCACGGAAA GCGCCGAGGA GGGCGAGCGC AACCTGAGCG AGATGGCGGG TGCTTTATGG CACAAGGCCG GTGATGAATG GCAGGCGTTG AAGCCGCATG TCGCCCCTTA TGCCAAGCGT CCGGCGACCT GGGCGGCGTT GGCCGCGCTG TTGCTGGCGG TAGTGGTACT GTCGGTCATG CGCCGGCGTG CCCGGCGCCG TCGGCGGCGT CGCCGCGAAC CGCACGTCTA G
|
Protein sequence | MRALFALLLV CVGGLMSSPG WTQDATPPDV RMIFDVSGSM KANDPANLRA SALQLAAALL PSQARGSVWT FGTQVRNPLP DGKVDAEWRR RARSLSPQLV DYQQFTDIEQ ALREASQAAG GKRHVILLTD GMVDLPGSGE VKRKRDAASR ETLIASLAPE LATQDVVVHT IALSRNVDRD LLERVSQSTD GLAAVAETPE ELLRAFLDVL ERIVPSDRVP LEDGRFDIDP EVDGFNALLF HDQDAPGATL VGPDGERYTR DDHPDDILWQ SMPRYDLIRV PDPAAGEWHV EGQVGDSRVT VESPLTLRTE TMPTSLYLGF DTPLEAWLSR EGETLVGDAM PEGMRMRAEL RDLDDATLSS TTLTAGEDGH FTGTLPAPEQ EGNARLVVTA EGPTRVRERV QGVNVIPPLA ASLNDDATTV ILEAQHPRLD ADNTRVSSSV LGESLDVEPV GPKTWHIALP DLDPHQSVPL ELTLEVTLDG RTWSIRLPAL RLNPDARIGL SGADVGAAPS AEALPDTGEE RTESAEEGER NLSEMAGALW HKAGDEWQAL KPHVAPYAKR PATWAALAAL LLAVVVLSVM RRRARRRRRR RREPHV
|
| |