Gene Csal_1466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1466 
Symbol 
ID4029180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1659872 
End bp1661662 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content68% 
IMG OID637966649 
Productvon Willebrand factor, type A 
Protein accessionYP_573518 
Protein GI92113590 
COG category 
COG ID 
TIGRFAM ID[TIGR03503] conserved hypothetical protein TIGR03503 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCGC TGTTCGCCTT ATTGCTGGTA TGCGTCGGCG GTCTGATGTC GTCGCCGGGA 
TGGACGCAAG ACGCGACGCC CCCGGATGTG CGCATGATTT TCGACGTCTC GGGCAGCATG
AAGGCCAACG ATCCCGCCAA CCTGCGTGCC TCGGCGCTGC AGCTGGCGGC GGCGCTGCTG
CCGTCTCAGG CGCGCGGTTC GGTCTGGACG TTCGGTACGC AGGTGCGCAA CCCCTTGCCC
GACGGCAAGG TGGATGCCGA GTGGCGGCGG CGTGCGCGCT CGCTGTCGCC GCAACTGGTC
GATTACCAGC AGTTCACCGA TATCGAGCAG GCGCTGCGCG AGGCCAGCCA GGCAGCGGGC
GGCAAGCGCC ACGTGATCCT GCTGACCGAT GGCATGGTCG ACCTGCCCGG GAGCGGCGAG
GTGAAGCGCA AGCGTGATGC GGCCTCGCGC GAGACCTTGA TTGCCTCGCT GGCGCCCGAG
CTGGCCACTC AGGACGTGGT GGTGCATACC ATTGCACTCT CGCGCAATGT GGACCGCGAC
CTGCTCGAGC GCGTGTCGCA GTCCACCGAC GGCCTGGCAG CAGTGGCCGA GACGCCGGAA
GAACTGCTGC GCGCCTTTCT CGATGTCCTC GAGCGCATCG TGCCCAGCGA CCGGGTGCCG
CTGGAGGACG GCCGCTTCGA CATCGACCCC GAGGTTGACG GTTTCAACGC CTTGCTTTTC
CACGACCAGG ACGCGCCCGG TGCCACGCTG GTGGGCCCGG ATGGCGAGCG TTACACCCGC
GATGATCATC CCGACGACAT TCTCTGGCAG AGCATGCCGC GTTACGACCT GATCCGGGTG
CCCGATCCCG CTGCAGGGGA ATGGCATGTC GAAGGCCAGG TGGGCGACAG TCGGGTCACC
GTCGAGTCGC CGCTGACGTT GCGCACCGAG ACCATGCCGA CGTCCTTGTA CCTCGGCTTC
GACACGCCGC TCGAGGCCTG GTTGTCACGC GAGGGCGAGA CACTCGTGGG CGATGCCATG
CCCGAGGGCA TGCGGATGCG CGCCGAGCTG CGCGATCTGG ACGACGCGAC GCTGTCGTCG
ACGACGCTGA CCGCGGGAGA GGACGGCCAT TTCACCGGAA CCCTGCCGGC GCCCGAGCAG
GAGGGCAACG CGCGTCTGGT GGTGACCGCC GAAGGCCCCA CGCGCGTGCG CGAGCGCGTT
CAGGGGGTCA ATGTCATCCC TCCCTTGGCG GCTTCCCTGA ACGACGATGC CACCACGGTC
ATCCTCGAGG CCCAGCACCC GCGACTCGAC GCCGATAACA CCCGGGTGTC GTCGAGCGTG
CTGGGCGAGT CGCTGGACGT GGAACCGGTC GGGCCGAAGA CATGGCACAT TGCATTGCCC
GACCTCGATC CGCATCAGTC GGTCCCGCTG GAACTCACGC TCGAGGTGAC GCTGGATGGT
CGTACCTGGT CGATTCGCCT TCCCGCGCTG CGCCTCAATC CCGATGCGCG CATCGGGCTT
TCCGGCGCCG ATGTCGGTGC CGCCCCGTCA GCGGAGGCCT TGCCGGACAC CGGCGAGGAA
CGCACGGAAA GCGCCGAGGA GGGCGAGCGC AACCTGAGCG AGATGGCGGG TGCTTTATGG
CACAAGGCCG GTGATGAATG GCAGGCGTTG AAGCCGCATG TCGCCCCTTA TGCCAAGCGT
CCGGCGACCT GGGCGGCGTT GGCCGCGCTG TTGCTGGCGG TAGTGGTACT GTCGGTCATG
CGCCGGCGTG CCCGGCGCCG TCGGCGGCGT CGCCGCGAAC CGCACGTCTA G
 
Protein sequence
MRALFALLLV CVGGLMSSPG WTQDATPPDV RMIFDVSGSM KANDPANLRA SALQLAAALL 
PSQARGSVWT FGTQVRNPLP DGKVDAEWRR RARSLSPQLV DYQQFTDIEQ ALREASQAAG
GKRHVILLTD GMVDLPGSGE VKRKRDAASR ETLIASLAPE LATQDVVVHT IALSRNVDRD
LLERVSQSTD GLAAVAETPE ELLRAFLDVL ERIVPSDRVP LEDGRFDIDP EVDGFNALLF
HDQDAPGATL VGPDGERYTR DDHPDDILWQ SMPRYDLIRV PDPAAGEWHV EGQVGDSRVT
VESPLTLRTE TMPTSLYLGF DTPLEAWLSR EGETLVGDAM PEGMRMRAEL RDLDDATLSS
TTLTAGEDGH FTGTLPAPEQ EGNARLVVTA EGPTRVRERV QGVNVIPPLA ASLNDDATTV
ILEAQHPRLD ADNTRVSSSV LGESLDVEPV GPKTWHIALP DLDPHQSVPL ELTLEVTLDG
RTWSIRLPAL RLNPDARIGL SGADVGAAPS AEALPDTGEE RTESAEEGER NLSEMAGALW
HKAGDEWQAL KPHVAPYAKR PATWAALAAL LLAVVVLSVM RRRARRRRRR RREPHV